This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 



BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 

Q BLACK BORDERS 
. TEXT CUT OFF AT TOP, BOTTOM OR SIDES 

• FADED TEXT 

• ILLEGIBLE TEXT 

• SKEWED/SLANTED IMAGES 

• COLORED PHOTOS 

. BLACK OR VERY BLACK AND WHITE DARK PHOTOS 
. GRAY SCALE DOCUMENTS 

IMAGES ARE BEST AVAILABLE COPY. 

As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PGT) 



(51) International Patent Classification: 

C12Q 1/68 


A1 


(11) International Publication Number: WO 00/06776 
(43) International Publication Date: 10 February 2000 (10.02.2000) 


(21) International Application Number PCT/US99/16675 

(22) International Filing Date: 22 July 1 999 (22.07.1 999) 

(30) Priority Data: 

60/094,391 28 July 1998(28.07.1998) US 

(60) Parent Application or Grant 

AXYS PHARMACEUTICALS, INC. [/]; (). GALVIN, 
Margaret [/]; (). MILLER, Andrew [/]; (). PENNY, Laura [/]; 
(). RIEDY, Michael [/fc (). GALVIN, Margaret [/]; 
O.MILLER, Andrew [/]; (). PENNY, Laura [/]; (). RIEDY, 
Michael [/]; (). SHERWOOD, Pamela, J. ; Q- 


Published 



(54) Title: GENOTYPING HUMAN UDP-GLUCURONOSYLTRANSFERASE 2B4 (UGT2B4), 2B7 (UGT2B7) AND 2B15 
(UGT2B15) GENES 

(54) Titre: ETA BLISSEMENT DU GENOTYPE DE GENES HUMAINS DE L'UDP-GLUCORONOSYL-TRANSFERASE 2B4 
(UGT2B4), 2B7 (UGT2B7) ET 2B15 (UGT2B15) 



(57) Abstract 

Genetic polymorphisms are identified in the human UGT2B4, UGT2B7 and UGT2B15 genes that alter UGT2B activity. Nucleic 
acids comprising the polymorphic sequences are used to screen patients for altered metabolism for UGT2B substrates, potential 
drug-drug interactions, and adverse/side effects, as well as diseases that result from environmental or occupational exposure to 
toxins. The nucleic acids are used to establish animal, cell and in vitro models for drug metabolism. 

(57) Abrege 

Des potymorphismes genetiques ont ete identifies dans les genes humains de UGT2B4, UGT2B7 et UGT2B15, qui modifient 
Tactivite de UGT2B. Selon I'invention on utilise des acides nucleiques comprenant les sequences polymorphiques pour cribler des 
patients, a la recherche du metabolisme modifie de substrats UGT2B, d'interactions potentielles medicamenteuses et d'effets 
secondaires, de meme qu'a la recherche de maladies induites par une exposition ambiante ou profession nelle aux toxines. On 
utilise les acides nucleiques pour etablir des modeles animaux, cellulaires et in vitro, du metabolisme des medicaments. 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 7 : 
C12Q 1/68 



Al 



(11) International Publication Number: WO 00/06776 

(43) International PubUcation Date: 10 February 2000 ( 1 0.02.O0) 



(21) International Application Number: PCT/US99/ 16675 

(22) International Filing Date: 22 July 1999 (22.07.99) 



(30) Priority Data: 
60/094391 



28 July 1998 (28.07.98) 



US 



(71) Applicant (for all designated Stales accept US): AXYS PHAR- 

MACEUTICALS , INC. [USOJS); 180 Kimball Way, South 
San Francisco, CA 94080 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): GALVIN, Margaret 
(US/US); 7768 Cone Promenade, Carlsbad, CA 92009 
(US). MILLER, Andrew [US/US]; 2131 Old Stone Mill 
Drive, Cranbury, NJ 08512 (US). PENNY, Laura [US/US]; 
3903 Falcon Street, San Diego, CA 92103 (US). RIEDY, 
Michael [US/US]; 4066 Grcsham Street #B, San Diego, 
CA 92109 (US). 

(74) Agent: SHERWOOD, Pamela, J.; Bozicevic, Field & Francis 
LLP, Suite 200. 285 Hamilton Avenue, Palo Alto, CA 94301 
(US). 



(81) Designated States: AE, AL, AM, AT, AU, AZ, BA, BB, BG, 
BR, BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, GB, 
GD, GE, GH, GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, 
KP, KR, KZ, LC, LK, LR, LS. LT, LU. LV, MD, MG, MK, 
MN, MW, MX, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, 
SK, SL, TJ, TM, TR, TT. UA, UG, US. UZ. VN, YU, ZA, 
ZW, ARIPO patent (GH, GM, KE, LS, MW, SD, SL, SZ, 
UG, ZW), Eurasian patent (AM, AZ, BY, KG, KZ, MD, 
RU, TJ, TM), European patent (AT, BE, CH, CY, DE, DK, 
ES, FI, FR, GB, GR. IE, IT, LU, MC, NL, PT, SE). OAPI 
patent (BF, BJ, CF, CG, CI, CM. GA, GN, GW, ML, MR, 
NE, SN, TD. TG). 



Published 

With international search report. 



(54) Tide: GENOTYPING HUMAN UDP-GLUCURONOS YLTR A NSFER A SE 2B4 (UGT2B4), 2B7 (UGT2B7) AND 2B15 (UGT2B15) 
GENES 



(57) Abstract 



Genetic polymorphisms are identified in the human UGT2B4, UGT2B7 and UGT2B15 genes that alter UGT2B activity. Nucleic 
acids comprising the polymorphic sequences are used to screen patients for altered metabolism for UGT2B substrates, potential dnig-<lrug 
interactions, and adverse/side effects, as well as diseases that result from environmental or occupational exposure to toxins. The nucleic 
acids are used to establish animal, cell and in vitro models for drug metabolism. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


Fl 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


CA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


CB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


CR 


Georgia 


MD 


Republic of Moldova 


TC 


Togo 


BB 




CH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BP 


Burkina Paso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Bean 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CP 


Central African Republic 


JP 




NE 


Niger 


VN 


Viet Nam 


CC 


Congo 


KB 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 
Zimbabwe 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 




ZW 


a 


Cole d' I voire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






cu 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DB 


Germany 


U 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SB 


Sweden 






EE 


Estonia 


LR 


Liberia 


sc 


Singapore 







Description 



5 



10 



15 



20 



25 



30 



35 



40 



45 



55 



5 



WO 00/06776 PCT/US99/16675 
GENOTYPING HUMAN UDP-GLUCURONOSYLTRANSFERASE 2B4 (UGT2B4), 2B7 
(UGT2B7) and 2B15 (UGT2B15) GENES 



Introduction 

10 5 The metabolic processes commonly involved in the biotransformation of xenobiotics 

have been classified into functionalization reactions (phase I reactions), in which lipophilic 
compounds are modified via monooxygenation, dealkylation, reduction, aromatization, or 
hydrolysis. These modified molecules can then be substrates for the phase II reactions, 

15 often called conjugation reactions, as they conjugate a functional group with a polar, 

10 endogenous compound. Drug glucuronidation, a major phase II conjugation reaction in the 
mammalian detoxification system, is catalyzed by the UDP-glucuronosyltransferases (UGTs) 
(Batt AM, et al. (1994) Clin Chim Acta 226:171-190; Burchell et aL (1995) Life Set. 57:1819- 

20 

31). 

The UGTs are a family of enzymes that catalyze the glucuronic add conjugation of 
15 a wide range of endogenous and exogenous substrates including phenols, alcohols, amines 
25 and fatty acids. The reactions catalyzed by UGTs permit the conversion of a large range of 

toxic endogenous/xenobkrtic compounds to more water-soluble forms for subsequent 
excretion (Parkinson A (1996) Toxicol Pathol 24:48-57). 

The UGT isoenzymes are located primarily in hepatic endoplasmic reticulum and 
30 20 nuclear envelope (Parkinson A (1996) Toxicol Pathol 24:48-57), though they are also 

expressed in other tissues such as kidney and skin. UGTs are encoded by a large multigene 
superfamily that has evolved to produce catalysts with differing but overlapping substrate 
specificities. Three families, UGT1, UGT2, and UGTS, have been identified within the 

35 

superfamily. UGTs are assigned to one the subfamilies based on amino acid sequence 
25 identity, e.g., UGT1 family members have greater than 45% amino acid sequence identity 
(Mackenzie et a/. (1997) Pharmacogenetics 7:255-69). 

A single gene encodes several human UGT1 isoforms, the substrate specificity of 

40 

each of which is thought to arise from differential splicing of a number of substrate-specific 
5-prime regions of a single mRNA transcript to a shared 3-prime portioa On the other hand, 
30 members of the mammalian UGT2 gene subfamily, which encode the odorant and 
45 steroid-metabolizing isoforms, show nucleotide differences in sequence throughout the 

length of the cDNAs. This suggested that the UGT2 isoenzymes are encoded by several 
independent genes. The UGT2 genes have been further divided on the basis of their 
tissue-specific expression patterns into the UGT2A gene subfamily, which encodes 
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olfactory-specific isoforms, and the UGT2B gene subfamily, which encodes 
steroid-metabolizing isoforms in the liver. Monaghan et al. (1994) Genomics 23:496-499 
mapped the UGT2B9 and the UGT2B15 genes to chromosome 4q13, giving a provisional 
ordering of the genes as UGT2B9-UGT2B4-UGT2B15. The UGT2B subfamily contains 
phenobarbital-inducible genes, as well as numerous genes that are constitutively expressed 
and are involved in the glucuronidation of endogenous steroids and biogenic amines 
(Mackenzie, ef al supra.) Evidence suggests that UGT2B4 is exclusively expressed in 
human liver, and not in human kidney. Levesque et al. (1997) Pharmacogenetics 7:317; and 
Coffman ef ai (1997) Drug Metabol. and Dispos. 25:1-4, describe UGT2B gene 
polymorphisms. 

Alteration of the expression or function of UGTs may affect drug metabolism. For 
example, there may be common polymorphisms in the human UGT2B gene that alter 
expression or function of the protein product and cause drug exposure-related phenotypes. 
Thus, there is a need in the field to identify UGT2B polymorphisms in order to provide a 
better understanding of drug metabolism and the diagnosis of drug exposure-related 
phenotypes. 

Summary of the invention 

Genetic sequence polymorphisms are identified in the UGT2B4, UGT2B7 and 
UGT2B15 genes, herein generically referred to as "UGT2B genes". Nucleic acids 
comprising the polymorphic sequences are used in screening assays, and for genotyping 
individuals. The genotyping information is used to predict an individuals' rate of metabolism 
for UGT2B substrates, potential drug-drug interactions, and adverse/side effects. Specific 
polynucleotides include the polymorphic UGT2B4 sequences set forth in SEQ ID NOs:25-38; 
the polymorphic UGT2B7 sequences set forth in SEQ ID NOs:84-111; and the polymorphic 
UGT2B15 sequences set forth in SEQ ID NOs:147-164. 

The nucleic acid sequences of the invention may be provided as probes for detection 
of UGT2B locus polymorphisms, where the probe comprises a polymorphic sequence of 
SEQ ID NOs:25-38; 84-1 11 and 147-164. The sequences may further be utilized as an 
array of oligonucleotides comprising two or more probes for detection of UGT2B locus 
polymorphisms. 

Another aspect of the invention provides a method for detecting in an individual a 
polymorphism in UGT2B metabolism of a substrate, where the method comprises analyzing 
the genome of the individual for the presence of at least one UGT2B polymorphism; wherein 
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the presence, of the predisposing polymorphism is indicative of an alteration in UGT2B 
expression or activity. The analyzing step of the method may be accomplished by detection 
of specific binding between the individual's genomic DNA with an array of oligonucleotides 
comprising UG72B locus polymorphic sequences. In other embodiments, the alteration in 
UGT2B expression or activity is tissue specific, oris in response to a UGT2B modifier that 
induces or inhibits UGT2B expression. 

Brief Description of the Sequence Listing 
UGT2B Reference Sequences. SEQ ID NOs: 1-6 Ost the sequence of the reference 
UGT2B4 exons, where exon 1 is SEQ ID NO:1 , exon 2 is SEQ ID NO:2 and so forth. Partial 
sequence of the flanking introns is included; the boundaries are annotated in the SEQUST. 
The cDNA sequence is set forth in SEQ ID NO:7, and the encoded amino acid sequence 
in SEQ ID NO:8. 

SEQ ID NO:39 lists the sequence of the UGT2B7 cDNA sequence, the encoded 
polypeptide is provided in SEQ ID NO:40. SEQ ID NOs: 41-45 list the sequence of the 
reference UGT2B7 exons, where exon 1 is SEQ ID NO:41, exon 2 is SEQ ID NO:42 and so 
forth. Partial sequence of the flanking introns is included; the boundaries are annotated in 
the SEQLIST. 

SEQ ID NO:112 lists the sequence of the UGT2B15 cDNA sequence, the encoded 
polypeptide is provided in SEQ ID NO: 113. SEQ ID NOs:114-118 list the sequence of the 
reference UGT2B15 exons, where exon 1 is SEQ ID NO:114, exon 2 is SEQ ID NO:115 and 
so forth. Partial sequence of the flanking introns is included; the boundaries are annotated 
in the SEQLIST. 

Primers. The PCR primers for amplification of polymorphic sequences are set forth 
as SEQ ID NOs:9-14; 46-66; and 135-146. The primers used in sequencing isolated 
polymorphic sequences are presented as SEQ ID NOs:15-24; 67-83; and 119-134. 

Polymorphisms. Polymorphic sequences of UGT2B4 are presented as SEQ ID 
NOs:25-38. Polymorphic sequences of UGT2B7 are presented as SEQ ID NOs:84-111. 
Polymorphic sequences of UGT2B15 are presented as SEQ ID NO:147-164. 
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Description of the Specific Embodiments 
Pharmacogenetics is the association between an individual's genotype and that 
individual's ability to metabolize or react to a therapeutic agent. Differences in metabolism 
or target sensitivity can lead to severe toxicity or therapeutic failure by altering the relation 
between bioactive dose and blood concentration of the drug. Relationships between 
polymorphisms in metabolic enzymes or drug targets and both response and toxicity can be 
used to optimize therapeutic dose administration. 

Genetic polymorphisms are identified in the UGT2B4, UGT2B7 and UGT2B15 genes. 
Nucleic acids comprising the polymorphic sequences are used to screen patients for altered 
metabolism for UGT2B substrates, potential drug-drug interactions, and adverse/side effects, 
as well as diseases that result from environmental or occupational exposure to toxins. The 
nucleic acids are used to establish animal, cell culture and in vitro cell-free models for drug 
metabolism. 

Definitions 

It is to be understood that this invention is not limited to the particular methodology, 
protocols, cell lines, animal species or genera, constructs, and reagents described, as such 
may vary. It is also to be understood that the terminology used herein is for the purpose of 
describing particular embodiments only, and is not intended to limit the scope of the present 
invention which will be limited only by the appended claims. 

As used herein the singular forms "a", "and", and "the" include plural referents unless 
the context clearly dictates otherwise. Thus, for example, reference to "a construct" includes 
a plurality of such constructs and reference to "the UGT2B nucleic acid" includes reference 
to one or more nucleic acids and equivalents thereof known to those skilled in the art, and 
so forth. All technical and scientific terms used herein have the same meaning as commonly 
understood to one of ordinary skill in the art to which this invention belongs unless clearly 
indicated otherwise. 

UGT2B4 reference sequence. The sequence of human UGT2B4 cDNA may be 
accessed through Genbank, accession number Y00317, and is provided inSEQ ID NOs:1-7. 
The amino acid sequence of UGT2B4 is listed as SEQ ID NO:8. The sequence of human 
UGT2B7 may be accessed through Genbank, accession number 600068, and in the 
SEQLIST as described above. The sequence of human UGT2B15 may be accessed 
through Genbak, accession number 600069, and in the SEQLIST as described above. The 
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nucleotide sequences provided herein differ from the published sequence at certain positions 
throughout the sequence. Where there is a discrepancy the provided sequence is used as 
a reference. 

The term "wild-type" may be used to refer to the reference coding sequences of 
UGT2B4, UGT2B7 and UGT2B15, and the term Variant", or "UGT2B" to refer to the 
provided variations in the UGT2B sequences. The UGT2B4, UGT2B7 and UGT2B15 
sequences are generically referred to as 'UGT2B", and may be further distinguished by the 
species, e.g. human, mouse, etc., or by the specific gene number, e.g. UGT2B4, UGT2B7, 
etc. Where there is no published form, such as in the intron sequences, the term wild-type 
may be used to refer to the most commonly found allele. It will be understood by one of skill 
In the art that the designation as "wild-type" Is merely a convenient label for a common allele, 
and should not be construed as conferring any particular property on that form of the 
sequence. 

UGT2B polymorphic sequences. It has been found that specific sites in the 
UGT2B4, UGT2B7 and UGT2B15 genes sequence are polymorphic, i.e. within a population, 
more than one nucleotide (G, A, T, C) is found at a specific position. Polymorphisms may 
provide functional differences in the genetic sequence, through changes in the encoded 
polypeptide, changes in mRNA stability, binding of transcriptional and translation factors to 
the DNA or RNA, and the like. The polymorphisms are also used as single nucleotide 
polymorphisms (SNPs) to detect genetic linkage to phenotypic variation in activity and 
expression of the particular UGT2B protein. 

SNPs are generally biallelic systems, that is, there are two alleles that an individual 
may have for any particular marker. SNPs, found approximately every kilobase, offer the 
potential for generating very high density genetic maps, which will be extremely useful for 
developing haplotyping systems for genes or regions of interest, and because of the nature 
of SNPs, they may in fact be the polymorphisms associated with the disease phenotypes 
under study. The low mutation rate of SNPs also makes them excellent markers for studying 
complex genetic traits. 

SNPs are provided in the UGT2B4, UGT2B7 and UGT2B15 intron and exon 
sequences. Tables 4, 7 and 10, and the corresponding sequence listing, provide both forms 
of each polymorphic sequence. For example, SEQ ID NO:37 and 38 are the alternative 
forms of a single polymorphic site. The provided sequences also encompass the 
complementary sequence corresponding to any of the provided polymorphisms. 
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In order to provide an unambiguous identification of the specific site of a 
polymorphism, sequences flanking the polymorphic site are shown in the tables, where the 
5' and 3 1 flanking sequence is non-polymorphic, and the central position, shown in bold, is 
variable. It wfll be understood that there is no special significance to the length of non- 
polymorphic flanking sequence that is included, except to aid in positioning the 
polymorphism in the genomic sequence. The UGT2B exon sequences have been published, 
and therefore one of each pair of the sequences from exons in Tables 4, 7 and 10 are 
publically known sequence. The intron sequence has not been published, and hence both 
forms of this polymorphic sequence is novel. 

As used herein, the term "UGT2B4, UGT2B7 and UGT2B15 genes' is intended to 
generically refer to both the wild-type and variant forms of the sequence, unless specifically 
denoted otherwise. As it is commonly used in the art, the term "gene" is intended to refer 
to the genomic region encompassing the 5' UTR, exons, introns, and the 3' UTR. Individual 
segments may be specifically referred to, e.g. exon 2, intron 5, etc. Combinations of such 
segments that provide for a complete UGT2B protein may be referred to generically as a 
protein coding sequence. 

Nucleic acids of interest comprise the provided UGT2B V nudeic acid sequence(s), 
as set forth in Tables 4, 7 and 10. Such nucleic acids include short hybridization probes, 
protein coding sequences, variant forms of UGT2B cDNA, segments, e.g. exons, introns, 
efc., and the like. Methods of producing nucleic acids are well-known in the art, including 
chemical synthesis, cDNA or genomic cloning, PCR amplification, etc 

For the most part, DNA fragments will be of at least 1 5 nt, usually at least 20 nt, often 
at least 50 nt Such small DNA fragments are useful as primers for PCR, hybridization 
screening, etc. Larger DNA fragments, i.e. greater than 100 nt are useful for production of 
the encoded polypeptide, promoter motifs, etc. For use in amplification reactions, such as 
PCR, a pair of primers will be used. The exact composition of primer sequences is not 
critical to the invention, but for most applications the primers will hybridize to the subject 
sequence under stringent conditions, as known in the art. 

The UGT2B nucleic acid sequences are isolated and obtained in substantial purity, 
generally as other than an intact or naturally occurring mammalian chromosome. Usually, 
the DNA will be obtained substantially free of other nucleic acid sequences that do not 
include a UGT2B sequence or fragment thereof, generally being at least about 50%, usually 
at least about 90% pure and are typically "recombinant", i.e. flanked by one or more 
nucleotides with which it is not normally associated on a naturally occurring chromosome. 
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For screening purposes, hybridization probes of the polymorphic sequences may be 

5 

used where both forms are present, either in separate reactions, spatially separated on a 
solid phase matrix, or labeled such that they can be distinguished from each other. Assays 
may utilize nucleic acids that hybridize to one or more of the described polymorphisms. 
10 5 An array may include all or a subset of the polymorphisms listed in Tables 4, 7 and 

10. One or both polymorphic forms may be present in the array, for example the 
polymorphism of SEQ ID NO:37 and 38 may be represented by either, or both, of the listed 
sequences. Usually such an array will include at least 2 different polymorphic sequences, 
15 i.e. polymorphisms located at unique positions within the locus, and may include all of the 

10 provided polymorphisms. Arrays of interest may further comprise sequences, including 
polymorphisms, of other genetic sequences, particularly other sequences of interest for 
pharmacogenetic screening, e.g. UGT1, other UGT2 sequences, cytochrome oxidase 
polymorphisms, etc. The oligonucleotide sequence on the array will usually be at least about 
12 nt in length, may be the length of the provided polymorphic sequences, or may extend 
15 into the flanking regions to generate fragments of 100 to 200 nt in length.' For examples of 
arrays, see Ramsay (1998) Nat. Biotech . 16:4044; Hacia et a/. (1996) Nature Genetics 
14:441-447; Lockhart et at. (1996) Nature Biotechnol . 14:1675-1680; and De Risi et at. 
(1996) Nature Genetics 14:457-460. 

Nucleic acids may be naturally occurring, e.g. DNA or RNA, or may be synthetic 
30 20 analogs, as known in the art. Such analogs may be preferred for use as probes because 

of superior stability under assay conditions. Modifications in the native structure, including 
alterations in the backbone, sugars or heterocyclic bases, have been shown to increase 
intracellular stability and binding affinity. Among useful changes in the backbone chemistry 
35 are phosphorothioates; phosphorodithioates, where both of the non-bridging oxygens are 

25 substituted with sulfur; phosphoroamidites; alkyl phosphotriesters and boranophosphates. 
Achiral phosphate derivatives include S'-O'-S'-S-phosphorothioate, 3'-S-5'-0- 
phosphorothioate, 3'-CH2-5'-0-phosphonate and 3'-NH-5'-0-phosphoroamidate. Peptide 
nucleic acids replace the entire ribose phosphodiester backbone with a peptide linkage. 
Sugar modifications are also used to enhance stability and affinity. The a-anomer 
30 of deoxyribose may be used, where the base is inverted with respect to the natural 
b-anomer. The 2'-OH of the ribose sugar may be altered to form 2-0- methyl or 2'-Oalryl 
sugars, which provides resistance to degradation without compromising affinity. 

Modification of the heterocyclic bases must maintain proper base pairing. Some 
useful substitutions include deoxyuridine for deoxythyrnidine; S-methy^- deoxycytkline and 

50 .j. 
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5-bromo-2'-deoxycytidine for deoxycytidine. 5- propynyl-2'- deoxyuridine and 
5-propyny^-deoxycytidine have been shown to increase affinity and biological activity when 
substituted for deoxythymidine and deoxycytidine, respectively. 

UGT2B polypeptides. A subset of the provided nucleic acid polymorphisms in 
UGT2B exons confer a change in the corresponding amino acid sequence. Using the amino 
acid sequence provided in SEQ ID NO:B as a reference for UGT2B4, the amino acid 
polymorphisms of the invention include lys-asn, pos. 40; and glu— asp, pos. 454. Using the 
amino acid sequence provided in SEQ ID NO:40 as a reference for UGT2B7, the amino acid 
polymorphisms of the invention include Ieu->phe, pos. 107; thr— He, pos. 179; and lys->gln, 
pos. 430. Using the amino acid sequence provided in SEQ ID NO:125 as a reference for 
UGT2B15, the amino acid polymorphisms of the invention include ser-gly, pos. 15; 
asp-»tyr, pos. 85; leu-+pro, pos. 170; his->gln t pos. 282; ala->val, pos. 398; vaMIe, pos. 
443; and thr-*lys, pos. 523. 

Polypeptides comprising at least one of the provided polymorphisms (UGT2B" 
polypeptides) are of interest The term "UGT2B V polypeptides 0 as used herein includes 
complete UGT2B protein forms, e.g. such splicing variants as known in the art, and 
fragments thereof, which fragments may comprise short polypeptides, epitopes, functional 
domains; binding sites; etc.\ and including fusions of the subject polypeptides to other 
proteins or parts thereof. Polypeptides will usually be at least about 8 amino acids in length, 
more usually at least about 12 amino acids in length, and may be 20 amino acids or longer, 
up to substantially the complete protein. 

The UGT2B4, UGT2B7 and UGT2B15 genetic sequences, including polymorphisms, 
may be employed for polypeptide synthesis. For expression, an expression cassette may 
be employed, providing for a transcriptional and translational initiation region, which may be 
inducible or constitutive, where the coding region is operably linked under the transcriptional 
control of the transcriptional initiation region, and a transcriptional and translational 
termination region. Various transcriptional initiation regions may be employed that are 
functional in the expression host. The polypeptides may be expressed in prokaryotes or 
eukaryotes in accordance with conventional ways, depending upon the purpose for 
expression. Small peptides can also prpared by chemical synthesis. 
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Substrate. A substrate is a chemical entity that is modified by UGT2B4, UGT2B7 or 
UGT2B15, usually under norma! physiological conditions. Although the duration of drug 
action tends to be shortened by metabolic transformation, drug metabolism is not 
"detoxification'. Frequently the metabolic product has greater biologic activity than the drug 
itself. In some cases the desirable pharmacologic actions are entirely attributable to 
metabolites, the administered drugs themselves being inert Likewise, the toxic side effects 
of some drugs may be due in whole or in part to metabolic products. 

Substrates can be either endogenous substrates, i.e. substrates normally found 
within the natural environment of UGT2B, such as estriol, or exogenous, i.e. substrates that 
are not normally found within the natural environment of UGT2B. UGT2B catalyzes 
glucuronidation of its substrates. The enzymes are specific for UDP-glucuronic acid, and not 
other UDP sugars. 

Exemplary UGT2B4 substrates (i.e., substrates of wild-type UGT2B4 and/or 
UGT2B4 V polypeptides) include, but are not necessarily limited to estriol and the catechol 
estrogens 4-hydroxyestrone, and 2-hydroxyestriol, 2-aminophenol, 4-methylumbellifereone, 
1-naphthol, 4-hydroxybiphenyl and 4-nitrophenol, 2-aminophenoi, 4-hydroxybiphenyl, 
menthol, etc., among other substrates (Burchell et al. (1991) DNA Cell Biol 10:487-494, Jin 
CJ f et al. (1993) Biochem Biophvs Res Commun 194:496-503). 

Exemplary UGT2B7 substrates (i.e., substrates of wild-type UGT2B7 and/or 
UGT2B7* polypeptides) include, but are not necessarily limited to oxazepam, hyodeoxychoBc 
acid, estriol, S-naproxen, ketoprofen, ibuprofen, fenoprofen, dofibric acid (Patel era/ (1995) 
Pharmacogenetics 5(1):43-49), morphine (Coffman et al (1997) Drug Metabolism and 
Disposition 25:1-4), DMXAA (5,6-dimethylxantheonone-4-acetic acid) (Miners et al (1997) 
Cancer Res 57:284), 2-Hydroxy AAF, 4 methylumbelliferone, carboxylic acid drugs (BP-7,8- 
trans diol) (Burchell eta]., supra.) 

Exemplary UGT2B1 5 substrates (/.e., substrates of wild-type UGT2B15 and/or 
UGT2B15 V polypeptides) include, but are not necessarily limited to 4-hydroxybiphenyl, 1- 
naphthol, 4 methylumbelliferone , naringenin, eugenol (Burchell et al., supra.), simple 
phenolic compounds, 7-hydroxylated coumarins, flavonoids, anthraquinones; endogenous 
estrogens and androgens (Green et al. (1994) Drug Metabolism and Disposition 22:799. 



Modifier. A modifier is a chemical agent that modulates the action of a UGT2B 
molecule, either through altering its enzymatic activity (enzymatic modifier) or through 
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modulation of expression (expression modifier, e.g., by affecting transcription or translation). 
In some cases the modifier may also be a substrate. 



Pharmacokinetic parameters. Pharmacokinetic parameters provide fundamental 
data for designing safe and effective dosage regimens. A drug's volume of distribution, 
clearance, and the derived parameter, halMrfe, are particularly important, as they determine 
the degree of fluctuation between a maximum and minimum plasma concentration during 
a dosage interval, the magnitude of steady state concentration and the time to reach steady 
state plasma concentration upon chronic dosing. Parameters derived from in vivo drug 
administration are useful in determining the clinical effect of a particular UGT2B genotype. 

Expression assay. An assay to determine the effect of a sequence polymorphism 
on UGT2B expression. Expression assays may be performed in cell-free extracts, or by 
transforming cells with a suitable vector. Alterations in expression may occur in the basal 
level that is expressed in one or more cell types, or in the effect that an expression modifier 
has on the ability of the gene to be inhibited or induced. Expression levels of a variant 
alleles are compared by various methods known in the art. Methods for determining 
promoter or enhancer strength include quantitation of the expressed natural protein; insertion 
of the variant control element into a vector with a reporter gene such as" f-galactosidase, 
luciferase, chloramphenicol acetyltransferase, etc. that provides for convenient quantitation; 
and the like. 

Gel shift or electrophoretic mobility shift assay provides a simple and rapid method 
for detecting DNA-binding proteins (Ausubel, F.M. et al (1989) In: Current Protocols in 
Molecular Biology, Vol. 2, John Wiley and Sons, New York). This method has been used 
widely in the study of sequence-specific DNA-binding proteins, such as transcription factors. 
The assay is based on the observation that complexes of protein and DNA migrate through 
a nondenaturing polyacrylamide gel more slowly than free DMA fragments or 
double-stranded oligonucleotides. The gel shift assay is performed by incubating a purified 
protein, or a complex mixture of proteins (such as nuclear or cell extract preparations), with 
an end-labeled DNA fragment containing the putative protein binding site. The reaction 
products are then analyzed on a nondenaturing polyacrylamide gel. The specificity of the 
DNA-binding protein for the putative binding site is established by competition experiments 
using DNA fragments or oligonucleotides containing a binding site for the protein of interest, 
or other unrelated DNA sequences. 
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Expression assays can be used to detect differences in expression of polymorphisms 
with respect to tissue specificity, expression level, or expression in response to exposure to 
various substrates, and/or timing of expression during development. For example, since 
UGT2B4 is expressed in liver, polymorphisms could be evaluated for expression in tissues 
other than liver, or expression in liver tissue relative to a reference UGT2B4 polypeptide. 

Substrate screening assay. Substrate screening assays are used to determine the 
metabolic activity of a UGT2B protein or peptide fragment on a substrate. Many suitable 
assays are known in the art, including the use of primary or cultured cells, genetically 
modified cells (e.g., where DNA encoding the UGT2B polymorphism to be studied is 
introduced into the cell within an artificial construct), cell-free systems, e.g. microsomal 
preparations or recombinant produced enzymes in a suitable buffer, or in animals, including 
human clinical trials (see, e.g., Burchell et at. (1995) Life Set. 57:1819-1831, specifically 
incorporated herein by reference. Where genetically modified ceDs are used, since most cell 
lines do not express UGT2B activity (liver cells lines being the exception), introduction of 
artificial construct for expression of the UGT2B polymorphism into many human and non- 
human cell lines does not require additional modification of the host to inactivate 
endogenous UGT2B expression/activity. Clinical trials may monitor serum, urine, etc. levels 
of the substrate or its metabolite(s). 

Typically a candidate substrate is input into the assay system, and the conversion 
to a metabolite is measured over time; The choice of detection system is determined by the 
substrate and the specific assay parameters. Assays are conventionally run, and will include 
negative and positive controls, varying concentrations of substrate and enzyme, etc. 

Genotyping: UGT2B genotyping is performed by DNA or RNA sequence and/or 
hybridization analysis of any convenient sample from a patient, e.g. biopsy material, blood 
sample (serum, plasma, etc.), buccal cell sample, etc. A nucleic acid sample from an 
individual is analyzed for the presence of polymorphisms in UGT2B, particularly those that 
affect the activity or expression of UGT2B. Specific sequences of interest include any 
polymorphism that leads to changes in basal expression in one or more tissues, to changes 
in the modulation of UGT2B expression by modifiers, or alterations in UGT2B substrate 
specificity and/or activity. 
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Linkage Analysis: Diagnostic screening may be performed for polymorphisms that 
are genetically linked to a phenotypic variant in UGT2B activity or expression, particularly 
through the use of microsatellite markers or SNPs. The microsateiiite marker or SNP itself 
may not phenotypically expressed, but is linked to sequences that result in altered activity 
or expression. Two polymorphic variants may be in linkage disequilibrium, i.e. where alleles 
show non-random associations between genes even though individual loci are in Hardy- 
Weinberg equilibrium. 

Linkage analysis may be performed alone, or in combination with direct detection of 
phenotypically evident polymorphisms. The use of microsatellite markers for genotyping is 
well documented. For examples, see Mansfield et at. (1994) Genomics 24:225-233; and 
Ziegle et al. (1992) Genomics 14:1026-1031. The use of SNPs for genotyping is illustrated 
in Underhill et a/. (1996) Proc Natl Acad Sci U S A 93:196-200. 

Transgenic animals. The subject nucleic acids can be used to generate genetically 
modified non-human animals or site specific gene modifications in cell lines. The term 
'transgenic" is intended to encompass genetically modified animals having a deletion or 
other knock-out of UGT2B4, UGT2B7 or UGT2B1 5 activity, having an exogenous UGT2B4, 
UGT2B7 or UGT2B15 gene that is stably transmitted in the host cells, or having an 
exogenous UGT2B promoter operably linked to a reporter gene. Transgenic animals may 
be made through homologous recombination, where the UGT2B locus is altered. 
Alternatively, a nucleic acid construct is randomly integrated into the genome. Vectors for 
stable integration include plasmids, retroviruses and other animal viruses, YACs, and the 
like. Of interest are transgenic mammals, e.g. cows, pigs, goats, horses, etc., and 
particularly rodents, e.g. rats, mice, etc. 

Genetically Modified Cells. Primary or cloned ceils and cell lines are modified by the 
introduction of vectors comprising UGT2B4, UGT2B7 and UGT2B1 5 genetic polymorphisms. 
The gene may comprise one or more variant sequences, preferably a haplotype of 
commonly occurring combinations. In one embodiment of the invention, a panel of two or 
more genetically modified cell lines, each cell line comprising a UGT2B polymorphism, are 
provided for substrate and/or expression assays. The panel may further comprise cells 
genetically modified with other genetic sequences, including polymorphisms, particularly 
other sequences of interest for pharmacogenetic screening, e.g. UGT1, other UGT2 
sequences, cytochrome oxidase polymorphisms, etc. 
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Vectors useful for introduction of the gene include plasmids and viral vectors, e.g. 
retroviral-based vectors, adenovirus vectors, etc. that are maintained transiently or stably 
in mammalian cells. A wide variety of vectors can be employed for transfection and/or 
integration of the gene into the genome of the ceils. Alternatively, micro-injection may be 
employed, fusion, or the like for introduction of genes into a suitable host cell. 

Genotvoing Methods 

The effect of a polymorphism in the UGT2B4, UGT2B7 or UGT2B15 gene sequence 
on the response to a particular substrate or modifier is determined by in vitro or in vivo 
assays. Such assays may include monitoring the metabolism of a substrate during clinical 
trials to determine the UGT2B enzymatic activity, specificity or expression level. Generally, 
in vitro assays are useful in determining the direct effect of a particular polymorphism, while 
clinical studies will also detect an enzyme phenotype that is genetically linked to a 
polymorphism. 

The response of an individual to the substrate or modifier can then be predicted by 
determining the UGT2B genotype, with respect to the polymorphism. Where there Is a 
differential distribution of a polymorphism by racial background, guidelines for drug 
administration can be generally tailored to a particular ethnic group. 

The basal expression level in different tissue may be determined by analysis of tissue 
samples from individuals typed for the presence or absence of a specific polymorphism. Any 
convenient method may be used, e.g. EUSA, RIA, etc. for protein quantitation, northern blot 
or other hybridization analysis, quantitative RT-PCR, etc. for mRNA quantitation. The tissue 
specific expression is correlated with the genotype. 

The alteration of UGT2B expression in response to a modifier is determined by 
administering or combining the candidate modifier with an expression system, e.g. animal, 
cell, in vitro transcription assay, etc. The effect of the modifier on UGT2B transcription 
and/or steady state mRNA levels is determined. As with the basal expression levels, tissue 
specific interactions are of interest. Correlations are made between the ability of an 
expression modifier to affect UGT2B activity, and the presence of the provided 
polymorphisms. A panel of different modifiers, cell types, etc. may be screened in order to 
determine the effect under a number of different conditions. 

A UGT2B polymorphism that results in altered enzyme activity or specificity is 
determined by a variety of assays known in the art. The enzyme may be tested for 
metabolism of a substrate in vitro, for example in defined buffer, or in cell or subcellular 
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lysates, where the ability of a substrate to be metabolized by UGT2B4, UGT2B7 or 
UGT2B15 under physiologic conditions is determined. Where there are not significant issues 
of toxicity from the substrate or metabolrte(s), in vivo human trials may be utilized, as 
previously described. 

The genotype of an individual is determined with respect to the provided UGT2B4, 
UGT2B7 and UGT2B15 polymorphisms. The genotype is useful for determining the 
presence of a phenotypicaliy evident polymorphism, and for determining the linkage of a 
polymorphism to phenotypic change. 

A number of methods are available for analyzing nucleic acids for the presence of 
a specific sequence. Where large amounts of DNA are available, genomic DNA is used 
directly. Alternatively, the region of interest is cloned into a suitable vector and grown in 
sufficient quantity for analysis. The nucleic acid may be amplified by conventional 
techniques, such as the polymerase chain reaction (PCR), to provide sufficient amounts for 
analysis. The use of the polymerase chain reaction is described in Saiki et a/. (1985) 
Science 230:1350-1354, and a review of current techniques may be found in Sambrook et 
ai Molecular Cloning: A Laboratory Manual, CSH Press 1989, pp. 14.2-14.33. Amplification 
may be used to determine whether a polymorphism is present, by using a primer that is 
specific for the polymorphism. Alternatively, various methods are known in the art that utilize 
oligonucleotide ligation as a means of detecting polymorphisms, for examples see Riley et 
at. (1990) Nucleic Acids Res 18:2887-2890: and Delahunty et ai (1996) Am J Hum Genet 
58:1239-1246. 

A detectable label may be included in an amplification reaction. Suitable labels 
include fluorochromes, e.g. fluorescein isothiocyanate (FITC), rhodamine, Texas Red, 
phycoerythrin, allophycocyanin, 6-carboxyfluorescein (6-FAM), r.T'-dimethoxy^',?- 
dichloro-6-carboxyfluorescein (JOE), 6-carboxy-X-rhodamine (ROX), 6-carboxy-2\4\7\4,7- 
hexachloroffuorescein (HEX), 5-carboxyfluorescein (5-FAM) or N.N.^N'-tetramethyl-S- 
carboxyrhodamine (TAMRA), radioactive labels, e.g. M P, ^S, 3 H; etc. The label may be a 
two stage system, where the amplified DNA is conjugated to biotin, haptens, etc. having a 
high affinity binding partner, e.g. avidin, specific antibodies, etc., where the binding partner 
is conjugated to a detectable label. The label may be conjugated to one or both of the 
primers. Alternatively, the pool of nucleotides used in the amplification is labeled, so as to 
incorporate the label into the amplification product. 

The sample nucleic acid, e.g. amplified or cloned fragment, is analyzed by one of a 
number of methods known in the art The nucleic acid may be sequenced by dideoxy or 
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other methods. Hybridization with the variant sequence may also be used to determine its 
presence, by Southern blots, dot blots, etc. The hybridization pattern of a control and variant 
sequence to an array of oligonucleotide probes immobilized on a solid support, as described 
in U.S. 5,445,934, or in WO95/35505, may also be used as a means of detecting the 
presence of variant sequences. Single strand conformational polymorphism (SSCP) 
analysis, denaturing gradient gel electrophoresis (DGGE), mismatch cleavage detection, and 
heteroduplex analysis in gel matrices are used to detect conformational changes created by 
DNA sequence variation as alterations in electrophoretic mobility. Alternatively, where a 
polymorphism creates or destroys a recognition site for a restriction endonuclease 
(restriction fragment length polymorphism, RFLP), the sample is digested with that 
endonuclease, and the products size fractionated to determine whether the fragment was 
digested. Fractionation is performed by gel or capillary electrophoresis, particularly 
acrylamide or agarose gels. 

In one embodiment of the invention, an array of ofigonucleotides are provided, where 
discrete positions on the array are complementary to one or more of the provided 
polymorphic sequences, e.g. oligonucleotides of at least 12 nt, frequently 20 nt t or larger, 
and including the sequence flanking the polymorphic position. Such an array may comprise 
a series of oligonucleotides, each of which can specifically hybridize to a different 
polymorphism. For examples of arrays, see Hacia at al (1996) Nat Genet 14:441-447 and 
DeRisi er al (1 996) Nat Genet 14:457-460. 

The genotype information is used to predict the response of the individual to a 
particular UGT2B substrate or modifier. Where an expression modifier inhibits UGT2B 
expression, then drugs that are a UGT2B substrate will be metabolized more slowly if the 
modifier is co-administered. Where an expression modifier induces UGT2B expression, a 
co-administered substrate will typically be metabolized more rapidly. Similarly, changes in 
UGT2B activity will affect the metabolism of an administered drug. The pharmacokinetic 
effect of the interaction will depend on the metabolite that is produced, e.g. a prodrug is 
metabolized to an active form, a drug is metabolized to an inactive form, an environmental 
compound is metabolized to a toxin, etc. Consideration is given to the route of 
administration, drug-drug interactions, drug dosage, etc. 

Examples 

The following examples are put forth so as to provide those of ordinary ski!) in the art 
with a complete disclosure and description of how to make and use the subject invention, 
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and are not intended to limit the scope of what is regarded as the invention. Efforts have 
been made to ensure accuracy with respect to the numbers used- (e.g., amounts, 
temperature, concentrations, etc.) but some experimental errors and deviations should be 
allowed for. Unless otherwise indicated, parts are parts by weight, molecular weight is 
average molecular weight, temperature is in degrees centigrade; and pressure is at or near 
atmospheric. 

Example 1 
Genotyping UGT2B4 

Materials and Methods 

DNA Samples. Blood specimens from approximately 48 individuals were collected 
after obtaining informed consent. All samples were stripped of personal identifiers to 
maintain confidentiality. The only data associated with a given blood samples was gender 
and self-reported major racial group designations in the United States (Caucasian, Hispanic, 
African American). Genomic DNA was isolated from these samples using standard 
techniques. DNA was stored either as a concentrated solution, or in a dried form in 
microtiter plates. 

PCR amplifications. The primers used to amplify exons in which polymorphisms 
were found are shown in Table 1 , and were designed with NBI's Oligo version 5.1 program. 
Sequences for exons in which no polymorphisms were found are not shown. 

Table 1. UGT2B4 PCR Primers. 
Primary PCR Amplification 



Region 


Forward/ 
Reverse 


SEQ ID NO 


Sequence 


UGT2B4Exon 1 


F 


9. 


Taccttttagttgtctctttgtca 




R 


10. 


Ttcctggagtcttctgtatga 


UGT2B4Exon 4 


F 


11. 


Catcccttgttcttctcatt 




R 


12. 


Cgggactggaaaataaatat 


UGT2B4Exon 6 


F 


13. 


Ggggtttcaccgtgtta 




R 


14. 


Aaagccaagcagcactaa 



Twenty-five nanograms of gDNA were amplified in the primary amplifications using 
the Perkin Elmer GeneAmp PCR kit according to the manufacturer's instructions in 25 ul 
reactions with AmpDTaq Gold DNA polymerase. Reactions contained 25 mM MgC^ and 0.2 
uM of each primer. Thermal cycling was performed using a GeneAmp PCR System 9600 
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PCR machine (Perkin Elmer), utilizing a touch-down PCR protocol. The protocol, unless 
indicated otherwise in Table 2, consisted of an initial incubation of 95°C for 10 min, followed 
by ten cycles of 95°C for 20 sec, 64°C (minus 1°C per cycle) for 20 sec, 72°C for 2 min, six 
cycles of 95°C for 20 sec, 54°C for 20 sec, 72°C for 2 min, and nineteen cycles of 95°C for 
20 sec, 54°C for 20 sec, 72°C for 2 min (plus 1 5 sec per cycle), and one final extension step 
of 72°Cfor 10 min. 

For the secondary PCR reactions, one microliter of each primary PCR reaction was 
re-amplified using the primary PCR primers. The thermal cycling profile that was used for 
the primary PCR for an exon was also used for the secondary PCR. 

Table 2. 
Cycling Profile Modifications 



Exon 


Primary PCR 


Secondary PCR 


1 


Touch-Down PCR step: 8 cycles 


same as Primary PCR 




64° C (minus 1° C per cycle), for 15 sec 






Total Number of cycles: 35 




4 


Touch-Down PCR step: 10 cycles 


same as Primary PCR 




64° C (minus 1° C per cycle), for 15 sec 






Total Number of cycles: 35 




6 


Touch-Down PCR step: 7 cydes 


same as Primary PCR 




64° C (minus 1° C per cycle), for 15 sec 






Total Number of cycles: 35 





DNA sequencing. PCR products from 48 individuals (approximately 1/3 African 
American, 1/3 Caucasian, 1/3 Hispanic) were prepared for sequencing by treating 8 pi of 
each PCR product with 0.15 pi of exonuclease I (1 .5 U/reaction), 0.3 pi of Shrimp Alkaline 
Phosphatase (0.3U/reaction), q.s. to 10 pi with MilliQ water, and incubated at 37°C for 30 
min, followed by 72°C for 1 5 min. Cycle sequencing was performed on the GeneAmp PCR 
System 9600 PCR machine (Perkin Elmer) using the ABI Prism dRhodamine Terminator 
Cycle Sequencing Ready Reaction Kit according to the manufacturer's directions, with the 
following changes: (1) 2 pi of dRhodamine terminator premix, instead of 8 pi, (2) 10% (v/v) 
Dimethylsulfoxide was added to each individual nucleotide. The oligonucleotide primers 
(unlabelled), at 3 picomoles per reaction, used for the sequencing reactions are listed in 
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Table 3. Sequencing reactions, with a final volume of 5 ul, were subjected to 25 cycles at 
96°C for 10 sec, 50°C for 5 sec, and 60°C for 4 min, followed by ethanol precipitation. After 
decanting the ethanol, samples were evaporated to dryness using a SpeedVac for roughly 
15 min and were resuspended in 2 pi of loading buffer (5:1 deionized formamide:50 mM 
EDTA pH 8.0), heated to 94°C for 2 min, and were electrophoresed through 5.25% 
polyacrylamide/6M urea gels in an ABI Prism 377 DNA Sequencer, according to the 
manufacturer's instructions for sequence determination. All sequences were determined 
from both the 5' and 3' (sense and antisense) direction. 



Table 3. Sequencing Primers 



P. No. 


F/R 


SEQ ID NO 


Forward Primer 


1 


F 


15. 


Ccacatgctcagactgttaa 




R 


16. 


Caaaaataccccactaccc 


2 


F 


17. 


Cccttgttcttctcattgtta 




R 


18. 


Ttcagtaagcttgtttcatgat 


3,4 


F 


19. 


Cctggccaaattgactt 




R 


20. 


Caggaacccagtcacatc 


5 


F 


21. 


Ggggaaaagagattaattacg 




R 


22. 


Agccaagcagcactaatc 


6,7 


F 


23. 


Tccaattcacaggttacatg 




R 


24. 


Agccaagcagcactaatc 
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Exon 


Nt change 


Summary of U 
AA change 


Table 4. 
3T2B4 polyn 
SEQ ID 


norphisms. 
Sequence 


1 


G 157 C 


Lys40 Asn 


25. 


Tggatgaatataaagacaatcctggat 








26. 


Tggatgaatataaacacaatcctggat 


lnt.4 


T61 C 




27. 


Aagtgttaatagttatcatgaaacaag 








28. 


Aagtgttaatagctatcatgaaacaag 


6 


T 1411 A 


Glu 454 Asp 


29. 


Tgaagccccttgatcgagcagtcttct 








30. 


Tgaagccccttgaacgagcagtcttct 


6 


C 1412A 




31. 










32. 


Tgaagccccttgatagagcagtcttct 


6 


T1849C 






Gatataaagccatacgaggttatattg 








34. 


Gatataaagccatatgaggttatattg 


6 


A 1919 C 




35. 


Caggttacatgaaaaaaaatttacta 








36. 


Caggttacatgaaaaacaatttacta 


6 


A 2072 G 




37. 


Ttgttgaggaagctaataaataattaa 








38. 


Ttgttgaggaaactaataaataattaa 



Nucleotide variants in exons are numbered from first base in Sequence 1 . Amino Acid 
changes are numbered beginning with the first methionine in the protein sequence provided 
in Sequence 1. The nucleotide variant in intron 4 is numbered from the beginning of intron 
4, as provided in Sequence 2.4. 

Example 2 
UGT2B7 Genotyping 
Twenty-five nanograms of gDNA were amplified in the primary amplifications using 

the Perkin Elmer GeneAmp PCR kit according to the manufacturer's instructions in 25 pi 

reactions with AmpfiT aq Gold DNA polymerase. Reactions contained 25 mM MgC^ and 0.2 

uM of each primer. Thermal cycling was performed using a GeneAmp PCR System 9600 

PCR machine (Perkin Elmer), utilizing a touch-down PCR protocol. The exons for UGT2B7 

were amplified using the following cycling conditions: An initial incubation at 96°C for 10 

min., followed by 16 cycles of 95°C for 20 sec. t 52°C for 20 sec. 72°C for 2 min., and 

nineteen cycles of 95°C for 20 sec, 52°C for 20 sec, 72°C for 2 min (plus 15 sec per cycle), 

and one final extension step of 72*C for 10 min. 
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For the secondary PCR reactions, one microliter of each primary PCR reaction was 
re-ampiified using the primary PCR primers. The thermal cycling profile that was used for 
the primary PCR for an exon was also used for the secondary PCR. 

The amplification primers are provided in Table 5, the sequencing primers in Table 
6 t and the polymorphisms in Table 7. 



Table 5 



PCR Primers 1 



Region 




SEQ ID NO 


Primer Sequence 


UGT2B7 Exon 1 


Primary F 


46. 


Cttggctaatttatctttgg 




Primary R 


47. 


Cccactaccctgactttat 




Secondary F 


48. 


Ggacataaccatgagaaatg 




Secondary R 


49. 


Agctctgcttcaaagacac 


UGT2B7 Exon 2 


Primary F 


50. 


Tgtccgtatgctactattgaa 




Primary R 


51. 


Tgtgctaatccctttgtaaat 




Secondary F 


52. 


Tttttttttctattcctgtcag 




Secondary R 


53. 


Ctttaccccacccattt 


UGT2B7 Exon 4 


Primary F 


54. 


Cccttgatctcattcctact 




Primary R 


55. 


Aactggctattctttagatgtatg 




Secondary F 


56. 


Cattcctactctttatacagttctc 




Secondary R 


57. 


Cccccgattcagactat 


UGT2B7 Exon 5 


Primary F 


58. 


Cccttgatctcattcctact 




Primary R 


59. 


Aactggctattctttagatgtatg 




Secondary F 


60. 


Tcctccgaagtctgaaac 




Secondary R 


61. 


Tataaaaaggatgaaactcacac 


UGT2B7 Exon 6 


Primary F 


62. 


Caagcccccaagttatgt 




Primary R 


63. 


Cagtaggatccgcgatataa 




Secondary F 


64. 


Tctgaggggttttgtctgta 




Secondary R 


65. 


Ccgcgatataagttcaacaa 



DAM sequencing. PCR products from 48 individuals were prepared for sequencing 
by treating 8 uL of each PCR product with 0.1 5 uL of exonuclease I (1 .SU/reaction), 0.3 uL 
of Shrimp Alkaline Phosphatase (0.3U/reaction), q.s. to 10 uL with MilliQ water, and 
incubated at 37°C for 30 min, followed by 72°C for 15 min. Cycle sequencing was performed 
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on the GeneAmp PCR System 9600 PCR machine (Perkin Elmer) using the ABl Prism 
dRhodamine Terminator Cycle Sequencing Ready Reaction Kit or the ABl Prism Big Dye 
Terminator Cycle Sequencing Ready Reaction Kit according to the manufacturer's directions, 
with the following changes: For the ABl Prism dRhodamine Terminator kit, (1) 2 uL of 
dRhodamine terminator premix, instead of 8 pL, (2) 10% (v/v) Dimethyfsuifoxide was added 
to each inalvidual nucleotide, (3) 5 uL total volume instead of 20 uL. For the ABl Prism Big 
Dye Terminator kit (1) 0.8 pL of Big Dye terminator premix, instead of 8 pL, and (2) 15 pL 
total volume instead of 20 pL The oligonucleotide primers (unlabeled), at 3 picomoles per 
reaction, used for the sequencing reactions are fisted in Table 6. Sequencing reactions, with 
a final volume of 5 pL, were subjected to 25 cycles at 96°C for 1 0 sec, 50°C for 5 sec, and 
60°C for 4 min, followed by ethanol precipitation. After decanting the ethanol, samples were 
evaporated to dryness using a SpeedVac for roughly 15 min and were resuspended in 2 uJ 
of loading buffer (5:1 deionized formamide:50 mM EDTA pH 8.0), heated to 94°C for 2 min, 
and were electrophoresed through 5.25% po!yacrytamide/6M urea gels in an ABl Prism 377 
DNA Sequencer, according to the manufacturer's instructions for sequence determination. 
All sequences were determined from both the 5* and 3' (sense and antisense) direction. 
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Table 6 



P. No. 


F/R 


SEQ ID NO 


Primer Sequence 


1.2 


F 


66. 


Ggacataaccatgagaaatg 




R 


67. 


Ttaagagcggatgagttgt 


3.4 


F 


68. 


Tcatcatgcaacagattaag 




R 


69. 


Cactacagggaaaaatagca 


5 


F 


70. 


Accctttgtgtacagtctca 




R 


71. 


Agctctgcttcaaagacac 


6.7 


F 


72. 


Ttgcctacattattctaaccc 




R 


73. 


Ctttaccccacccattt 


8.9 


F 


74. 


Cattcctactctttatacagttctc 




R 


75. 


C ccccgattcagactat 


10 


F 


76. 


Cattcctactctttatacagttctc 




R 


77. 


Cccccgattcagactat 


11,12 


F 


78. 


Tcctccgaagtctgaaac 




R 


79. 


Tataaaaaggatgaaactcacac 


13 


F 


80. 


Tctgaggggttttgtctgta 




R 


81. 


Ttttttgtctcaggaagaaaga 


14 


F 


82. 


Aaaaaaagaaaaaaaaatcttttc 




R 


83. 


Ccgcgatataagttcaacaa 
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Table 7 



N 


Exon 


Nt change 


AA change 


SEQ ID 
Kin 


Sequence 


1 


1 


G13A 




84. 


Tgcattgcaccaggatgtctgt 










85. 


Tgcattgcaccaagatgtctgt 


2 


«, 


T151 C 


Leu 107 Phe 


86. 


Tcctggatgagcttattcagaga 










87. 


Tcctggatgagcctattcagaga 


3 


1 


A236T 




88. 


Cattttggttatatttttcac 










89. 


Cattttggttttatttttcac 


4 




A286G 




90. 


Cataactagaaagttctgtaa 










91. 


Cataactaggaagttctgtaa 


5 


! 


C450T 


ThM79 lie 


92. 


Cctggctacacttttgaaaa 










93. 


Cctggctacatttttgaaaa 


6 


2 


A14G 




94. 


Gaagacccactacattatctg 










95. 


Gaagacccactacgttatctg 


7 


2 


AT 80-81 TC 




96. 


Aattttcagtttccatatccactctt 










97. 


Aattttcagtttcctcatccactctt 


8 


4 


C57G 




98. 


Taggtctcaatactcggctcta 










99. 


Taggtctcaatactcggctgta 


9 


4 


C60T 




100. 


Tacaagtggataccccaga 










101. 


Tataagtggataccccaga 


10 


ln.4 


A 154 del 




102. 


Gggagaaagaatacattataattttt 










103. 


Gggagaaagaatacttataattttt 


11 


5 


C101 T 




104. 


Ttccattgtttgccgatcaac 










105. 


Ttccattgtttgctgatcaac 


12 


5 


A198C 


Lys430 Gin 


106. 


Gaatgcattgaagagagtaat 










107. 


Gaatgcattgcagagagtaat 


13 


6 


A197G 




108. 


Ctggtctgtgtggcaactgtga 










109. 


Ctggtctgtgtggcgactgtga 


14 


6 


C528A 




110. 


Taagataaagccttatgag 










111. 


Taagataaagacttatgag 
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Example 3 
GenotypingUGT2B15 
Sequencing and analysis were performed as described in Example 2. The 
amplification primers are provided in Table 9, the sequencing primers in Table 8, and the 
5 polymorphisms in Table 1 0. 

Table 8 

Sequencing Primers UGT2B15 



Region 

15 




SEQ ID NO 


Primer Sequence 


UGT2B15 Exon 1 


Primary F 


119. 


catgcacctattcagactgt 




Primary R 


120. 


tgggtgtcctgtagtagtga 




Secondary F 


121. 


attgatttttcctcagatataagta 


20 


Secondary R 


122. 


tcataatttcccttaaaaacac 


UGT2B15Exon2 


Primary F 


123. 


atatgtttgggtatgttattcc 




Primary R 


124. 


ccatattcccctcactct 




Secondary F 


125. 


atacctgcatattcaaataacaa 


25 


Secondary R 


126. 


tatccagccattccttct 


UGT2B15Exon5 


Primary F 


127. 


agttttgtgggtataatgttac 




Primary R 


128. 


aaacgggttaaaattcata 


30 


Secondary F 


129. 


tcataccttgtaattaataattttg 




Secondary R 


130. 


cgggttaaaattcatattca 


UGT2B15Exon6 


Primary F 


131. 


tcatgccaattcagtgac 




Primary R 


132. 


accctccatgctgaaat 


35 


Secondary F 


133. 


tcaaagaccatccatagactt 




Secondary R 


134. 


ggagtcccatctttcagtc 



40 



45 



50 .24- 



5 



10 



55 



WO 00/06776 PCI7US99/16675 

Table 9 
PGR Primers UGT2B15 



P. No. 


F/R 


SEQ ID NO 


Primer Sequence 


1.2 


F 


135. 


Attgatttttcctcagatataagta 




R 


136. 


Atttactggcattgacaag 


3 


F 


137. 


Attgatttttcctcagatataagta 




R 


138. 


Tgtacagaaagggtatgttaaa 


4 


F 


139. 


aaaaat g/t atttggaagattc 




R 


140. 


Tcataatttcccttaaaaacac 


5 


F 


141. 


Atacctgcatattcaaataacaa 




R 


142. | 


Tatccagccattccttct 


67 


F 


143. 


Tcataccttgtaattaataattttg 




R 


144. 


Cgggttaaaattcatattca 


8.9 


F 


145. 


Tcaaagaccatccatagactt 




R 


146. 


Ggagtcccatctttcagtc 
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Exon 



Table 10 

Summary of Sequence Polymorphisms UGT2B15 
Ntd change AA change SEQ ID NO. Sequence 



A53G 



Ser15Gly 



147. 



tgatacagctcagttgtta c 



148. 



tgatacagctcggttgttac 



T184G 



149. 



tgttgacatcttcggcttct 



150. 



tgttgacatcgtcggcttct 



G263T 



Asp 85 Tyr 



151. 



ctttaactaaaaatgatttggaa 



152. 



ctttaactaaaaattatttggaa 



T519C 



Leu 170 Pro 



153. 



tttaacataccctttctgtaca 



154. 



tttaacataccctttccgtaca 



C122G 



His 282 Gin 



155. 



ttggaggacttcactgtaaacc 



156. 



ttggaggacttcagtgtaaacc 



G59A 



157. 



tatgaggcgatctaccatgggat 



158. 



tatgaggcaatctaccatgggat 



C100T 



Ala 398 Val 



159. 



cccttgtttgcggatcaacatgat 



160. 



cccttgtttgtggatcaacatgat 



G14A 



Val 443 lie 



161. 



aaagagaatgtcatgaaattat 



162. 



aaagagaatatcatgaaattat 



C523A 



Thr 523 Lys 



163. 



gcttgccaaaacaggaaagaa 



164. 



gcttgccaaaaaaggaaagaa 



All publications and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. The citation of any 
publication is for its disclosure prior to the filing date and should not be construed as an 
admission that the present invention is not entitled to antedate such publication by virtue of 
prior invention. 

Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity of understanding, it will be readily apparent 
to those of ordinary skill in the art in light of the teachings of this invention that certain 
changes and modifications may be made thereto without departing from the spirit or scope 
of the appended claims. 
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What is Claimed is: 

1. An isolated nucleic acid molecule comprising a UGT2B sequence 
polymorphism of SEQ ID NOs:25-38; 84-1 1 1 or 147-164, as part of other than a naturally 
occurring chromosome. 

5 

2. A nucleic acid probe for detection of UGT2B locus polymorphisms, comprising 
a polymorphic sequence of SEQ ID NOs:25-38; 84-1 11 or 147-164. 

1$ 3- A nucleic acid probe according to Claim 2, wherein said probe is conjugated 

10 to a detectable marker. 



10 



20 



25 



40 



45 



4. An array of oligonucleotides comprising: 

two or more probes for detection of UGT2B locus polymorphisms, said probes 
comprising at least one form of a polymorphic sequences of SEQ ID NOs:25-38; 84-1 1 1 or 
15 147-164. 



5. A method for detecting in an individual a polymorphism in a UGT2B 
metabolism of a substrate, the method comprising: 

analyzing the genome of said individual for the presence of at least one UGT2B 
30 20 polymorphism of SEQ ID NOs:25-38; 84-111 or 147-164; wherein the presence of said 

predisposing polymorphism is indicative of an alteration in UGT2B expression or activity. 

6. A method according to Claim 5, wherein said analyzing step comprises 
35 detection of specific binding between the genomic DNA of said individual with an array of 

25 oligonucleotides comprising: 

two or more probes for detection of UGT2B locus polymorphisms, said probes 
comprising at least one form of a polymorphic sequence of SEQ ID NOs:25-38; B4-1 11 or 
147-164. 



30 7. A method according to Claim 5, wherein said alteration in UGT2B expression 

is tissue specific. 

8. A method according to Claim 5, wherein said alteration in UGT2B expression 
is in response to a UGT2B modifier. 

50 . 27 . 
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9. A method according to Claim 8, wherein said modifier induces UGT2B 
expression. 

10. A method according to Claim 8, wherein said modifier inhibits UGT2B 
expression. 
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SEQUENCE LISTING 



<110> Penny, Laura 

Galvin, Margaret 
Miller, Andrew 
Reidy, Michael 

<120> Genotyping Human 

UDP-Glucuronosyltransferase 2B4 {UGT2B4 ) , 2B7 (UGT2B7) and 
2B15 (UGT2B15) Genes 

<130> SEQ-22PRV2 

<160> 164 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 1323 
<212> DNA 
<213> H. sapiens 

<220> 

<221> Other 

<222> (140)... (897) 

<400> 1 

tcatctacct tttagttgtc tctttgtcat ccacatgctc agactgttaa tataatgtat 60 

ttactttgaa gtgtaaaagt tacattttaa cttcttgact gatttatact ggatgtcacc 120 

atgagaaatg acagaaagga gcagcaactg gaaaacaagc attgcattgc atcaggatgt 180 

ctatgaaatg gacttcagct cttctgctga tacagctgag ctgttacttt agctctggga 240 

gttgtggaaa ggtgctggtg tggcccacag aattcagcca ctggatgaat ataaagacaa 300 

tcctggatga acttgtccag agaggtcatg aggtgactgt attggcatct tcagcttcca 360 

tttctttcga tcccaacagc ccatctactc ttaaatttga agtttatcct gtatctttaa 420 

ctaaaactga gtttgaggat attatcaagc agctggttaa gagatgggca gaacttccaa 480 

aagacacatt ttggtcatat ttttcacaag tacaagaaat catgtggaca tttaatgaca 540 

tacttagaaa gttctgtaag gatatagttt caaataagaa acttatgaag aaactacagg 600 

agtcaagatt tgatgttgtt cttgcagatg ctgttttccc ctttggtgag ctgctggccg 660 

agttacttaa aatacccttt gtctacagcc tccgcttctc tcctggctac gcaattgaaa 720 

agcatagtgg aggacttctg ttccctcctt cctatgtgcc tgttgttatg tcagaactaa 780 

gtgaccaaat gactttcata gagagggtaa aaaatatgat ctatgtgctt tattttgaat 840 

tttggttcca aatatttgac atgaagaagt gggatcagtt ctacagtgaa gttctaggta 900 

agtaactttt ttgattggta acatgaagat ctaactttct tgtacctttg aagctgagtt 960 

tgtataaagc cataaagtca gggtagtggg gtatttttgt aatgaattta tcaaatgaaa 1020 

ttgtaagatg atctaccaaa ctcacaagca ctatagaaaa tgtaaattat aggatcagtt 1080 

aaaactctgt ggccatcact catacagaag actccaggaa gtcataagcc tgtatattag 1140 

tgcacctaag atttctttaa gcaatcacat atctgtttta ttatacattt tttcatctta 1200 

aaaaaaagtc agacttattc agaaacatct tgctgaatgc atactggtag attgagtagt 1260 

tacacatttt ttagaactat ctatataaca ttgcagaaat tgttttttct tgtatatttg 1320 

ca 9 1323 

<210> 2 
<211> 746 
<212> DNA 
<213> H . sapiens 
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<221> Other 

<222> (195) . . . (344) 

<4 00> 2 

ttcttgtaaa tacacatggg taaaatatat aatacataaa aattaaatta tgcctatata 60 

cgaatatatg tatttttttt caaggcacaa acactttgcc tacatttttg cccacattat 120 

tctaacttct ttcagaaaat tacctagttt aattatcttg tgtcatctat cttttctttt 180 

tttttccccc atcaggaaga cccactacgt tatctgagac aatggcaaaa gctgacatat 240 

ggcttattcg aaactactgg gattttcaat ttcctcaccc actcttacca aatgttgagt 300 

tcgttggagg actccactgc aaacctgcca aacccctacc gaaggtaaac tattactgtt 360 

tgttttgtct gctttgaagt ttcagtacga atggttctat attcattcaa agtgtttgac 420 

ttacactgga agaaaggtgg aagtgggaag ag'taaagcag ataccaatta gaaactgacg 480 

tacatgttga tactatcaca agtttatgaa tttcatcatt attaccaata aagagggata 540 

ctaaagagac tttgaaaata gggttggtaa attaaagctt tgattatgca acatataaga 600 

aggtactggc cattcattca aagaatattt ataaagagat tagcacacac cacaggtacg 660 

tgtatgggac acagtttcta tcccaacaca ccttacattc tattttgaaa gatagaatat 720 

atgcaagtaa taaaaactgt gtaaaa 746 

<210> 3 
<2U> 785 
<212> DNA 
<213> H. sapiens 

<220> 

<221> Other 

<222> (238) . . . (369) 

<400> 3 

ttcacaaacg cacacacata cacacacaca tatttacaca aagaccctta acagaggcaa 60 

cctatctcat attatacata ttgcaaaaaa aactgagtaa ttgagtcagt taaaaaacat 120 

cctttactcc aataattcct gataaaactt gattttctct ctttttataa caattctttc 180 

acagtgcttg ctgtgctgat aatctattat gatagaacaa attctttttt ttcacaggaa 240 

atggaagagt ttgtccagag ctctggagaa aatggtgttg tggtgttttc tctggggtcg 300 

atggtcagta acacgtcaga agaaagggcc aatgtaattg catcagccct tgccaagatc 360 

ccacaaaagg taagataaaa tgttttaatg gtgtaaaaaa ctactgaaag aggctgttaa 420 

agtttgtaaa gaacccaatt gtagaaactt cctgcctata tattcagctg ttgggaaagc 480 

actaattatc tcagatatta attcaaaatc aaaaatatgt atggaagatg ataaactcat 540 

acagaaggtg tttttcattg gtaattaatt tggcattaat attgtgatca ggaataaata 600 

caattaagag ttgcaggtaa agttttggta ttatcatgat actggggtca ggtaagagct 660 

atcaccaaat tctgcccctg tgatttgatc cttttgttta agaactcctg agggcgatgt 720 

acatcctaca ggtgttagaa aacgttacat tttaatgagt aacttcacta gcacaataac 780 

aatag 795 

<210> 4 
<211> 1138 
<212> DNA 
<213> H. sapiens 

<220> 

<221> Other 

<222> (395) . . . (482) 

<400> 4 

catctgttat tttttgagtt tttaataatg gccattctga ctggtgtgag atgg^tatctc 60 

tttgtggatt taaccagtga tgtaaacctt tttttcatat agtggtttgc cacatatagt 120 

tttcttttga aaagtgtaac aactttttaa atacttgaac ttttcattga ttatcttatt 180 

tgtctaagct actattttga aaaatcatga tttccttata tacctaatta tgaaattaag 240 

gaaatgaaat atgagtattc tatttacatc agtctgagta gttcttgtta cttaacatcc 300 

cttgttcttc tcattgttaa tctctttaga tttctaacat tctatgactt ttgagttcca 360 

ctcatggaat aagatatttt cttcactgta acaggttctg tggagatttg atgggaataa 420 

accagatact ttaggactca atactcggct gtacaagtgg ataccccaga atgatcttct 480 

tggtaagtct ctgaagaaca aatactgaat atattagtaa cagattatta aagtgttaat 540 
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agctatcatg aaacaagctt actgaacatt tgttatggaa aaacttaaaa ataaaatgaa 600 

acttctttat atttattttc cagtcccggg ggaaaagaat aaattgttgg cattttatga 660 

tatgcaccca cattctttac aatcagagtc agagtatctt tatttcaggt gttattacct 720 

cccacagaat ttttctggca cttcctgggt tgtcttcctt tctcatattt ctacaacttt 780 

acacctgttc tttcctcctc tgtagggtta tttcaaatgt cactaaaagt aacagctctt 840 

ctgctatcac cagggatgct gcattttctg taggattaaa tccctaatct taatcaaaaa 900 

gtgatgacac atttcataat gaaatgtgac ctgtctttcc tcaattctag caccaccacc 960 

acctcactgc ctgctgcctt gcacacccta catatccaac tccgtgactg tacttaagag 1020 

aacacattct ggctgggcac ggtgctcacg cctgcaatcc tagcactttg ggaggctgat 1080 

ggcaggtgga ttgactgagc tcaggagttc aagaccatcc tgggcaacat ggtgaaac 1138 

<210> 5 
<211> 689 
<212> DNA 
<213> H. sapiens 

<220> 

<221> Other 

<222> (123)... (342) 

<400> 5 

aaaaacaatt ttaattcagt tcagtgtgtt atctaggaaa caccgtcaca ttcagattct 60 

tccattgtgc atttctcatt ttattcctat gaataatttt gctaaaattc atccaatcct 120 

aggtcaccca aaaaccagag cttttataac tcatggtgga gccaatggca tctatgaggc 180 

aatctaccat ggaatcccta tggtgggcgt tccattgttt gcagatcaac ctgataacat 240 

tgcacacatg aaggccaagg gagcagctgt tagtttggac ttccacacaa tgtcgagtac 300 

agacttactc aatgcactga agacagtaat taatgatcct ttgtgagtat aacttttttt 360 

ttactcggtg gtctttatag ataggttccc ttgtgaatag tgagtatgac ttttatcctt 420 

tttataagcg actgatttcg aaagaattta agtgatttaa acaatctgaa atctgctttt 480 

atttttgagt ggttatttaa aaattttatt tgaaccacat acatttaatg aataatcaat 540 

tattgaaata attttctaca caaaaataat tttaaagtga tatagataag aagacatttt 600 

aaaataaatt tgacgtaatc aatccacagt agaaaggaaa gataaacttg acgtaatata 660 

ataaaatatt ttaattcaat atctaaaat ~ " 689 

<210> 6 
<211> 1589 
<212> DNA 
<213> H. sapiens 

<220> 

<221> Other 

<222> (731) . . . (1475) 

<400> 6 

atgcttaagc aatgggtagc ctttcttcat gatgtgatta tttcacactg cagcctgtat 60 

caaaacatct catgcacctc atagaaaaat acccctacta tgtaaccaca aaaactaaaa 120 

attaaaagaa aataaaattg ctcatatgtt ctctgcctca aataattaac tttctcacct 180 

gaccctccat ttttacttta aaaatatttg tcaattatga aattccaatt taaaagccaa 240 

actttctatg atgactcaaa "ttaraaataca cacattctat gtcaattcta tgacatttac 300 

tttgaatgat ctggcacttt aaaaaccttt cgtggacttg atgtgctcag gcaaattaac 360 

ttaccttctc tttttttgag agggaagtct cactctgtca ccaggctgga gtgcagtggt 420 

gtgattgtgg ctcactgcaa cttccgcctc ttgggttcaa gcgattctcc tgcctcagcc 480 

tctcaagtag ctgggactac aggcacatgc caccacgcct gggtaatctt tttttttttt 540 

ttttttttca tatttttact ggagacgggg tgaacggggt ttcaccgtgt tagccaggat 600 

ggtcttgatc tcctgacctc gtgatccgcc cgcctcgacc tcggaaagtg ctgggattgc 660 

aggtgtgagc ctccgtgcct ggccaaattg acttactttc aatgttgata cttttctgct 720 

tatcgtttag atataaagag aatgctatga aattatcaag aattcatcat gatcaaccag 780 

tgaagcccct tgatcgagca gtcttctgga ttgaatttgt catgcgccat aaaggagcca 840 

agcaccttcg ggttgcagcc cacgacctca cctggttcca gtaccactct ttggatgtga 900 

ctgggttcct gctggcctgt gtggcaactg tgatattcat catcacaaaa tgtctgtttt 960 

gtgtctggaa gtttgttaga acaggaaaga aggggaaaag agattaatta cgtctgaggc 1020 

tggaagctgg gaaacccaat aaatgaactc ctttagttta ttacaacaag aagacgttgt 1080 
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gatacaagag attcctttct tcttgtgaca aaacatcttt caaaacttac cttgtcaagt 1140 

caaaatttgt tttagtacct gtttaaccat tagaaatatt tcatgtcaag gaggaaaaca 1200 

ttagggaaaa caaaaatgat ataaagccat acgaggttat attgaaatgt attgagctta 1260 

tattgaaatt tattgttcca attcacaggt tacatgaaaa aaaatttact aagcttaact 1320 

acatgtcaca cattgtacat ggaaacaaga acattaagaa gtccactgac agtatcagta 1380 

ctgttttgca aatactcagc atactttgga tccatttcat gcaggattgt gttgttttaa 1440 

ctgttgttga ggaaactaat aaataattaa attgtataga aagtctcttc ctcttgatat 1500 

tttgagatga ttagtgctgc ttggctttta ttgtgcatcg tgcttcaacg tcattttttt 1560 

tcctaaaagg tatgataaaa aatgcttac 1539 

<210> 7 
<211> 2092 
<212> DNA 
<213> H. sapiens 



<220> 
<221> CDS 

<222> (38)... (1621) 



<400> 7 

agcagcaact ggaaaacaag cattgcattg catcagg atg tct atg aaa tgg act 55 

Met Ser Met Lys Trp Thr 
1 5 

tea get ctt ctg ctg ata cag ctg age tgt tac ttt age tct ggg agt 103 
Ser Ala Leu Leu Leu He Gin Leu Ser Cys Tyr Phe Ser Ser Gly Ser 
10 15 " 20 

tgt gga aag gtg ctg gtg tgg ccc aca gaa ttc age cac tgg atg aat 151 
Cys Gly Lys Val Leu Val Trp Pro Thr Glu Phe Ser His Trp Met Asn 
25 30 35 

ata aag aca ate ctg gat gaa ctt gtc cag aga ggt cat gag gtg act 199 
He Lys Thr He Leu Asp Glu Leu Val Gin Arg Gly His Glu Val Thr 
40 45 50 

gta ttg gca tct tea get tec att tct ttc gat ccc aac age cca tct 247 
Val Leu Ala Ser Ser Ala Ser He Ser Phe Asp Pro Asn Ser Pro Ser 
55 60 65 70 

act ctt aaa ttt gaa gtt tat cct gta tct tta act aaa act gag ttt 295 
Thr Leu Lys Phe Glu Val Tyr Pro Val Ser Leu Thr Lys Thr Glu Phe 
75 80 " 85 

gag gat att ate aag cag ctg gtt aag aga tgg gca gaa ctt cca aaa 343 
Glu Asp He He Lys Gin Leu Val Lys Arg Trp Ala Glu Leu Pro Lys 
90 95 " 100 

gac aca ttt tgg tea tat ttt tea' caa gta caa gaa ate atg tgg aca 391 
Asp Thr Phe Trp Ser Tyr Phe Ser Gin Val Gin Glu He Met Trp Thr 
105 HO 115 

ttt aat gac ata ctt aga aag ttc tgt aag gat ata gtt tea aat aag 4 39 

Phe Asn Asp He Leu Arg Lys Phe Cys Lys Asp He Val Ser Asn Lys 
120 125 130 

aaa ctt atg aag aaa eta cag gag tea aga ttt gat gtt gtt ctt gca 487 
Lys Leu Met Lys Lys Leu Gin Glu Ser Arg Phe Asp Val Val Leu Ala 
135 140 145 150 

gat get gtt ttc ccc ttt ggt gag ctg ctg gec gag tta ctt aaa ata 535 
Asp Ala Val Phe Pro Phe Gly Glu Leu Leu Ala Glu Leu Leu Lys He 
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155 160 165 

ccc ttt gtc tac age etc cgc ttc tct cct ggc tac gca att gaa aag 583 
Pro Phe Val Tyr Ser Leu Arg Phe Ser Pro Gly Tyr Ala lie Glu Lys 
170 175 180 

cat agt gga gga ctt ctg ttc cct cct tec tat gtg cct gtt gtt atg 631 
His Ser Gly Gly Leu Leu Phe Pro Pro Ser Tyr Val Pro Val Val Met 
185 190 195 

tea gaa eta agt gac caa atg act ttc ata gag agg gta aaa aat atg 679 
Ser Glu Leu Ser Asp Gin Met Thr Phe He Glu Arg Val Lys Asn Met 
200 205 210 

ate tat gtg ctt tat ttt gaa ttt tgg ttc caa ata ttt gac atg aag 727 
lie Tyr Val Leu Tyr Phe Glu Phe Trp Phe Gin He Phe Asp Met Lys 
215 220 225 230 

aag tgg gat cag ttc tac agt gaa gtt eta gga aga ccc act acg tta 775 
Lys Trp Asp Gin Phe Tyr Ser Glu Val Leu Gly Arg Pro Thr Thr Leu 
235 240 245 

tct gag aca atg gca aaa get gac ata tgg ctt att cga aac tac tgg 823 
Ser Glu Thr Met Ala Lys Ala Asp He Trp Leu He Arg Asn Tyr Trp 
250 255 260 

gat ttt caa ttt cct cac cca etc tta cca aat gtt gag ttc gtt gga 871 
Asp Phe Gin Phe Pro His Pro Leu Leu Pro Asn Val Glu Phe Val Gly 
265 270 275 

gga etc cac tgc aaa cct gec aaa ccc eta ccg aag gaa atg gaa gag 919 
Gly Leu His Cys Lys Pro Ala Lys Pro Leu Pro Lys Glu Met Glu Glu 
280 2B5 290 

ttt gtc cag age tct gga gaa aat ggt gtt gtg gtg ttt tct ctg ggg 967 
Phe Val Gin Ser Ser Gly Glu Asn Gly Val Val Val Phe Ser Leu Gly 
295 300 305 310 

teg atg gtc agt aac acg tea gaa gaa agg gec aat gta att gca tea 1015 
Ser Met Val Ser Asn Thr Ser Glu Glu Arg Ala Asn Val He Ala Ser 
315 320 325 

gec ctt gec aag ate cca caa aag gtt ctg tgg aga ttt gat ggg aat 1063 
Ala Leu Ala Lys He Pro Gin Lys Val Leu Trp Arg Phe Asp Gly Asn 
330 335 340 

aaa cca gat act tta gga etc aat act egg ctg tac aag tgg ata ccc 1111 
Lys Pro Asp Thr Leu Gly Leu Asn Thr Arg Leu Tyr Lys Trp He Pro 
345 350 • 355 

cag aat gat ctt ctt ggt cac cca aaa acc aga get ttt ata act cat 1159 
Gin Asn Asp Leu Leu Gly His Pro Lys Thr Arg Ala Phe He Thr His 
360 365 370 

ggt gga gec aat ggc ate tat gag gca ate tac cat gga ate cct atg 1207 
Gly Gly Ala Asn Gly He Tyr Glu Ala He Tyr His Gly He Pro Met 
375 380 385 390 

gtg ggc gtt cca ttg ttt gca gat caa cct gat aac att gca cac atg 1255 
Val Gly Val Pro Leu Phe Ala Asp Gin Pro Asp Asn He Ala His Met 
395 400 405 
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aag gcc aag gga gca get gtt agt ttg gac ttc cac aca atg teg agt 1303 

Lys Ala Lys Gly Ala Ala Val Ser Leu Asp Phe His Thr Met Ser Ser 
410 415 420 

aca gac tta etc aat gca ctg aag aca gta att aat gat cct tta tat 1351 

Thr Asp Leu Leu Asn Ala Leu Lys Thr Val He Asn Asp Pro Leu Tyr 
425 430 435 

aaa gag aat get atg aaa tta tea aga att cat cat gat caa cca gtg 1399 

Lys Glu Asn Ala Met Lys Leu Ser Arg He His His Asp Gin Pro Val 
440 445 450 

aag ccc ctt gat cga gca gtc ttc tgg att gaa ttt gtc atg cgc cat 14 47 

Lys Pro Leu Asp Arg Ala Val Phe Trp He Glu Phe Val Met Arg His 
455 460 465 470 

aaa gga gcc aag cac ctt egg gtt gca gcc cac gac etc ace tgg ttc 1495 

Lys Gly Ala Lys His Leu Arg Val Ala Ala His Asp Leu Thr Trp Phe 
475 480 485 

cag tac cac tct ttg gat gtg act ggg ttc ctg ctg gcc tgt gtg gca 1543 

Gin Tyr His Ser Leu Asp Val Thr Gly Phe Leu Leu Ala Cys Val Ala 
490 495 500 

act gtg ata ttc ate ate aca aaa tgt ctg ttt tgt gtc tgg aag ttt 1591 

Thr Val He Phe He He Thr Lys Cys Leu Phe Cys Val Trp Lys Phe 
505 510 515 

gtt aga aca gga aag aag ggg aaa aga gat taattacgtc tgaggctgga 1641 

Val Arg Thr Gly Lys Lys Gly Lys Arg Asp 
520 525 



agctgggaaa cccaataaat gaactccttt agtttattac aacaagaaga cgt'tgtgata 1701 

caagagattc ctttcttctt gtgacaaaac atctttcaaa acttaccttg tcaagtcaaa 1761 

atttgtttta gtacctgttt aaccattaga aatatttcat gtcaaggagg aaaacattag 1821 

ggaaaacaaa aatgatataa agecataega ggttatattg aaatgtattg agcttatatt 1881 

gaaatttatt gttccaattc acaggttaca tgaaaaaaaa tttactaagc ttaactacat 1941 

gtcacacatt gtacatggaa acaagaacat taagaagtcc actgacagta tcagtactgt 2001 

tttgeaaata ctcagcatac tttggatcca tttcatgcag gattgtgttg ttttaactgt 2061 

tgttgaggaa actaataaat aattaaattg t 2092 



<210> 


8 








<211> 


528 








<212> 


PRT 








<213> 


H. sapiens 








<400> 


8 








Met Ser Met 


Lys Trp Thr 


Ser Ala Leu 


Leu Leu 


He Gin Leu Ser Cys 


1 


5 




10 


15 


Tyr Phe Ser 


Ser Gly Ser 


Cys Gly Lys 


Val Leu 


Val Trp Pro Thr Glu 




20 


25 




30 


Phe Ser His 


Trp Met Asn 


He Lys Thr 


He Leu 


Asp Glu Leu Val Gin 


35 




40 




45 


Arg Gly His 


Glu Val Thr 


Val Leu Ala 


Ser Ser 


Ala Ser He Ser Phe 


50 




55 




60 


Asp Pro Asn 


Ser Pro Ser 


Thr Leu Lys 


Phe Glu 


Val Tyr Pro Val Ser 


65 


70 




75 


80 


Leu Thr Lys 


Thr Glu Phe 


Glu Asp He 


He Lys 


Gin Leu Val Lys Arg 




85 




90 


95 


Trp Ala Glu 


Leu Pro Lys 


Asp Thr Phe 


Trp Ser 


Tyr Phe Ser Gin Val 




100 


105 




110 


Gin Glu He 


Met Trp Thr 


Phe Asn Asp 


He Leu 


Arg Lys Phe Cys Lys 
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115 








120 


Asp 


He 


Val 


Ser 


Asn 


Lys 


Lys Leu 














135 


Phe 


Asp 


Val 


Val 


Leu 


Ala 


Asp Ala 


145 










150 


Aid 


ulU 


T nn 

Leu 


Leu 


Lys 


He 


Pro Phe 










165 








Tyr 


ai -, 
/ua 


lie 


Glu 


Lys 


His Ser 








i an 
lull 






- 

lyr 


Vdi 


fro 


vai 


Val 


Met 


Ser Glu 






195 








200 


Glu 


Arg 


Vdl 


Lys 


Asn 


Met 


He Tyr 




210 










215 


Gin 


He 


fne 


Asp 


Met 


Lys 


Lys Trp 


225 










£ JO 




pit* 


Arg 


fro 


Thr 


Thr 


Leu 


Ser Glu 










OA C 






Leu 


lie 


Arg 


Asn 


Tyr 


Trp 


Asp Phe 








260 








Asn 


Val 


Glu 


Phe 


Val 


Gly 


Gly Leu 






Z ID 






2B0 


Pro 


Lys 


Glu 


Met 


Glu 


Glu 


Phe Val 




290 










295 


Val 


Val 


Phe 


Ser 


Leu 


Gly 


Ser Met 


305 










310 




Ala 


Asn 


Val 


He 


Ala 


Ser 


Ala Leu 










325 






Trp 


Arg 


Phe 


Asp 


Gly 


Asn 


Lys Pro 








340 






Leu 


Tyr 


Lys 


Trp 


He 


Pro 


Gin Asn 






355 








360 


Arg 


Ala 


Phe 


He 


Thr 


His 


Gly Gly 




370 










375 


Tyr 


His 


Gly 


He 


Pro 


Met 


Val Gly 


JO J 










390 




Asn 


lie 


/via 


U { o 

nlS 


Met 


Lys Ala 










405 






Phe 


His 


Thr 


Met 
420 


Ser 


Ser 


Thr Asp 


He 


Asn 


Asp 


Pro 


Leu 


Tyr 


Lys Glu 






435 








440 


His 


His 


Asp 


Gin 


Pro 


Val 


Lys Pro 




450 










455 


Glu 


Phe 


Val 


Met 


Arg 


His 


Lys Gly 


465 










470 


His 


A3p 


Leu 


Thr 


Trp 


Phe 


Gin Tyr 










485 






Leu 


Leu 


Ala 


Cys 


Val 


Ala 


Thr Val 








500 








Phe 


Cys 


Val 


Trp 


Lys 


Phe 


Val Arg 






515 








520 



<210> 9 

<211> 24 

<212> DNA 

<213> H. sapiens 

<400> 9 
taccttttag ttgtctcttt gtca 

<210> 10 
<211> 21 







125 






Met 


Lys 


Lys Leu Gin 


Glu 


Ser Arg 






140 




Val 


Phe 


Pro Phe Gly 


Glu 


Leu Leu 






155 




160 


Val 


Tyr 


Ser Leu Arg 


Phe 


Ser Pro 




170 




175 


Gly Gly 


Leu Leu Phe 


Pro 


Pro Ser 


185 






190 




Leu 


Ser 


Asp Gin Met 


Thr 


Phe He 






205 






Val 


Leu 


Tyr Phe Glu 


Phe 


Trp Phe 






220 




Asp Gin 


Phe Tyr Ser 


Glu 


Val Leu 






235 




" 240 


Thr 


Met 


Ala Lys Ala 


Asp 


He Trp 




250 






255 


Gin 


Phe 


Pro His Pro 


Leu 


Leu Pro 


265 






270 




His 


Cys 


Lys Pro Ala 


Lys 


Pro Leu 






285 




Gin 


Ser 


Ser Gly Glu 


Asn 


Gly Val 






300 




Val 


Ser 


Asn Thr Ser 


Glu 


Glu Arg 






315 




320 


Ala 


Lys 


He Pro Gin 


Lys 


Val Leu 




330 




335 


Asp Thr 


Leu Gly Leu 


Asn 


Thr Arg 


345 






350 


Asp 


Leu 


Leu Gly His 


Pro 


Lys Thr 






365 




Ala 


Asn 


Gly He Tyr 


Glu 


Ala He 






380 






Val 


Pro 


Leu Phe Ala 


Asp 


Gin Pro 






395 


400 


Lys Gly 


Ala Ala Val 


Ser 


Leu Asp 




410 






415 


Leu 


Leu 


Asn Ala Leu 


Lys 


Thr Val 


425 






4 30 




Asn 


Ala 


Met Lys Leu 


Ser 


Arg He 






445 




Leu Asp 


Arg Ala Val 


Phe 


Trp He 






460 




Ala 


Lys 


His Leu Arg 


Val 


Ala Ala 






475 




480 


His 


Ser 


Leu Asp Val 


Thr 


Gly Phe 




4 90 






495 


He 


Phe 


He He Thr 


Lys 


Cys Leu 


505 






510 




Thr Gly 


Lys Lys Gly 


Lys 


Arg Asp 






525 
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<212> DNA 

<213> H. sapiens 



<400> 10 
ttcctggagt cttctgtatg a 



21 



<210> 11 
<211> 20 
<212> DNA 
<213> H. sapiens 

<400> 11 

catcccttgt tcttctcatt 20 

<210> 12 
<211> 20 
<212> DNA 
<213> H. sapiens 



<210> 13 
<211> 17 
<212> DNA 
<213> H. sapiens 

<40O> 13 

ggggtttcac cgtgtta 17 

<210> 14 
<211> 18 
<212> DNA 
<213> H. sapiens 



<210> 15 
<211> 20 
<212> DNA 
<213> H. sapiens 

<40O> 15 

ccacatgctc agactgttaa 20 

<210> 16 
<211> 19 
<212> DNA 
<213> H. sapiens 



<400> 12 
cgggactgga aaataaatat 



20 



<40O> 14 
aaagccaagc agcactaa 



18 



<400> 16 
caaaaatacc ccactaccc 



19 



<210> 17 
<211> 21 
<212> DNA 



<213> H. sapiens 



<40O> 17 
cccttgttct tctcattgtt a 



21 



WO 00/06776 

<210> 18 
<211> 22 
<212> DNA 
<213> H. sapiens 

<400> 18 
ttcagtaagc ttgtttcatg at 

<210> 19 

<211> 17 

<212> DNA 

<213> H. sapiens 

<400> 19 
cctggccaaa ttgactt 

<210> 20 

<211> 18 

<212> DNA 

<213> H. sapiens 

<400> 20 
caggaaccca gtcacatc . 

<210> 21 
<211> 21 
<212> DNA 
<213> H. sapiens 

<400> 21 
ggggaaaaga gattaattac g 

<210> 22 
<211> 18 
<212> DNA 
<213> H. sapiens 

<400> 22 
agccaagcag cactaatc 

<210> 23 
<211> 20 
<212> DNA 
<213> H. sapiens 

<400> 23 
tccaattcac aggttacatg 

<210> 24 
<211> 16 
<212> DNA 
<213> H. sapiens 

<400> 24 
agccaagcag cactaatc 

<210> 25 
<211> 27 
<212> DNA 
<213> H. sapiens 

<400> 25 



PCT/US99/16675 



22 



17 



18 



21 



18 



20 
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tggatgaata taaagacaat cctggat 

<210> 26 

<211> 27 

<212> DNA 

<213> H. sapiens 

<400> 26 
tggatgaata taaacacaat cctggat 

<210> 27 
<211> 27 
<212> DNA 
<213> H. sapiens 

<40O> 27 
aagtgttaat agttatcatg aaacaag 

<210> 28 
<211> 27 
<212> DNA 
<213> H. sapiens 

<40O> 28 
aagtgttaat agctatcatg aaacaag 

<210> 29 
<211> 27 
<212> DNA 
<213> H. sapiens 

<40O> 29 
tgaagcccct tgatcgagca gtcttct 

<210> 30 
<211> 27 
<212> DNA 
<213> H. sapiens 

<40O> 30 
tgaagcccct tgaacgagca gtcttct 

<210> 31 
<211> 27 
<212> DNA 
<213> H. sapiens 

<40O> 31 
tgaagcccct tgatcgagca gtcttct 

<210> 32 
<211> 27 
<212> DNA 
<213> H. sapiens 

<40O> 32 
tgaagcccct tgatagagca gtcttct 

<210> 33 
<211> 27 
<212> DNA 
<213> H. sapiens 



PCT/US99/16675 
27 



27 



27 



27 



27 



27 



27 
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<400> 33 
gatataaagc catacgaggt tatattg 



27 



<210> 34 
<211> 27 
<212> DNA 
<213> H. sapiens 

<400> 34 

gatataaagc catatgaggt tatattg 27 

<210> 35 
<211> 26 
<212> DNA 
<213> H. sapiens 



<210> 36 
<211> 26 
<212> DMA 
<213> H. sapiens 

<40O> 36 

caggttacat gaaaaacaat ttacta 26 

<210> 37 
<211> 27 
<212> DNA 
<213> H. sapiens 



<210> 38 
<211> 27 
<212> DNA 
<213> H. sapiens 

<400> 38 

ttgttgagga aactaataaa taattaa 27 

<210> 39 
<211> 1854 
<212> DNA 
<213> H. sapiens 

<220> 
<221> CDS 

<222> (15) ...(1584) 



<400> 35 
caggttacat gaaaaaaaat ttacta 



26 



<400> 37 
ttgttgagga agctaataaa taattaa 



27 



<400> 39 

tgcattgcac cagg atg tct gtg aaa tgg act tea gta att ttg eta ata 
Met Ser Val Lys Trp Thr Ser Val He Leu Leu He 
15 10 



50 



caa ctg age ttt tgc ttt age tct ggg aat tgt gga aag gtg ctg gtg 
Gin Leu Ser Phe Cys Phe Ser Ser Gly Asn Cys Gly Lys Val Leu Val 
15 20 25 



98 



-11 



WO 00/06776 PCT/US99/16675 

tgg gca gca gaa tac age cat tgg atg aat ata aag aca ate ctg gat 14 6 

Trp Ala Ala Glu Tyr Ser His Trp Met Asn lie Lys Thr He Leu Asp 
30 35 40 

gag ctt att cag aga ggt cat gag gtg act gta ctg gca tct tea get 194 
Glu Leu He Gin Arg Gly His Glu Val Thr Val Leu Ala Ser Ser Ala 
45 50 55 60 

tec att ctt ttt gat ccc aac aac tea tec get ctt aaa att gaa att 242 
Ser He Leu Phe Asp Pro Asn Asn Ser Ser Ala Leu Lys He Glu He 
65 70 75 

tat ccc aca tct tta act aaa act gag ttg gag aat ttc ate atg caa 290 
Tyr Pro Thr Ser Leu Thr Lys Thr Glu Leu Glu Asn Phe He Met Gin 
80 85 90 

cag att aag aga tgg tea gac ctt cca aaa gat aca ttt tgg tta tat 338 
Gin He Lys Arg Trp Ser Asp Leu Pro Lys Asp Thr Phe Trp Leu Tyr 
95 100 105 

ttt tea caa gta cag gaa ate atg tea ata ttt ggt gac ata act aga 386 
Phe Ser Gin Val Gin Glu He Met Ser He Phe Gly Asp He Thr Arq 
110 115 120 

aag ttc tgt aaa gat gta gtt tea aat aag aaa ttt atg aaa aaa gta 434 
Lys Phe Cys Lys Asp Val Val Ser Asn Lys Lys Phe Met Lys Lys Val 
125 130 135 ' 140 

caa gag tea aga ttt gac gtc att ttt gca gat get att ttt ccc tgt 482 
Gin Glu Ser Arg Phe Asp Val He Phe Ala Asp Ala He Phe Pro Cys 
145 150 155 

agt gag ctg ctg get gag eta ttt aac ata ccc ttt gtg tac agt etc 530 
Ser Glu Leu Leu Ala Glu Leu Phe Asn He Pro Phe Val Tyr Ser Leu 
160 165 170 

age ttc tct cct ggc tac act ttt gaa aag cat agt gga gga ttt att 578 
Ser Phe Ser Pro Gly Tyr Thr Phe Glu Lys His Ser Gly Gly Phe He 
175 180 185 

ttc cct cct tec tac gta cct gtt gtt atg tea gaa tta act gat caa 626 
Phe Pro Pro Ser Tyr Val Pro Val Val Met Ser Glu Leu Thr Asp Gin 
190 195 200 

atg act ttc atg gag agg gta aaa aat atg ate tat gtg ctt tac ttt 674 
Met Thr Phe Met Glu Arg Val Lys Asn Met He Tyr Val Leu Tyr Phe 
205 210 215 220 

gac ttt tgg ttc gaa ata ttt gac atg aag aag tgg gat cag ttt tat 722 
Asp Phe Trp Phe Glu He Phe Asp Met Lys Lys Trp Asp Gin Phe Tyr 
225 230 235 

agt gaa gtt eta gga aga ccc act aca tta tct gag aca atg ggg aaa 770 
Ser Glu Val Leu Gly Arg Pro Thr Thr Leu Ser Glu Thr Met Gly Lys 
240 245 250 

get gac gta tgg ctt att cga aac tec tgg aat ttt cag ttt cca tat 818 
Ala Asp Val Trp Leu He Arg Asn Ser Trp Asn Phe Gin Phe Pro Tyr 
255 260 265 

cca etc tta cca aat gtt gat ttt gtt gga gga etc cac tgc aaa cct 866 
Pro Leu Leu Pro Asn Val Asp Phe Val Gly Gly Leu His Cys Lys Pro 
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270 



275 



280 



gcc aaa ccc ctg cct aag gaa atg gaa gac ttt gta cag age tct gga 
Ala Lys Pro Leu Pro Lys Glu Met Glu Asp Phe Val Gin Ser Ser Gly 



aca gaa gaa agg gcc aac gta att gca tea gcc ctg gcc cag ate cca 
Thr Glu Glu Arg Ala Asn Val lie Ala Ser Ala Leu Ala Gin lie Pro 
320 325 330 

caa aag gtt ctg tgg aga ttt gat ggg aat aaa cca gat ace tta ggt 
Gin Lys Val Leu Trp Arg Phe Asp Gly Asn Lys Pro Asp Thr Leu Glv 
335 340 345 

etc aat act egg etc tac aag tgg ata ccc cag aat gac ctt eta ggt 
Leu Asn Thr Arg Leu Tyr Lys Trp He Pro Gin Asn Asp Leu Leu Glv 
350 355 360 

cat cca aag acc aga get ttt ata act cat ggt gga gcc aat ggc ate 
His Pro Ly3 Thr Arg Ala Phe He Thr His Gly Gly Ala Asn Glv He 
36 ^ 370 375 * 380 

tac gag gca ate tac cat ggg ate cct atg gtg ggg att cca ttg ttt 
Tyr Glu Ala He Tyr His Gly He Pro Met Val Gly He Pro Leu Phe 
385 390 395 

gcc gat caa cct gat aac att get cac atg aag gcc agg gga gca get 
Ala Asp Gin Pro Asp Asn He Ala His Met Lys Ala Arg Gly Ala Ala 
400 405 410 

gtt aga gtg gac ttc aac aca atg teg agt aca gac ttg ctg aat gca 
Val Arg Val Asp Phe Asn Thr Met Ser Ser Thr Asp Leu Leu Asn Ala 
415 420 425 

ttg aag aga gta att aat gat cct tea tat aaa gag aat gtt atg aaa 
Leu Lys Arg Val He Asn Asp Pro Ser Tyr Lys Glu Asn Val Met Lys 
430 435 440 

tta tea aga att caa cat gat caa cca gtg aag ccc ctg gat cga gca 
Leu Ser Arg He Gin His Asp Gin Pro Val Lys Pro Leu Asp Arg Ala 
4 <5 450 455 460 

gtc ttc tgg att gaa ttt gtc atg cgc cac aaa gga get aaa cac ctt 
Val Phe Trp He Glu Phe Val Met Arg His Lys Gly Ala Lys His Leu 
465 470 475 

egg gtt gca gcc cac gac etc acc tgg ttc cag tac cac tct ttg gat 
Arg Val Ala Ala His Asp Leu Thr Trp Phe Gin Tyr His Ser Leu Asp 
480 485 490 

gtg att ggg ttc ctg ctg gtc tgt gtg gca act gtg ata ttt ate gtc 
Val He Gly Phe Leu Leu Val Cys Val Ala Thr Val He Phe He Val 
495 500 505 

aca aaa tgt tgt ctg ttt tgt ttc tgg aag ttt get aga aaa gca a 
Thr Lys Cys Cys Leu Phe Cys Phe Trp Lys Phe Ala Arg Lys Ala 



914 
962 
1010 
1058 
1106 
1154 
1202 
1250 
1298 
1346 
1394 
1442 
1490 
1538 
1584 




510 



520 
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agaagggaaa aaatgattag ttatatctga gatttgaagc tggaaaacct gataggtgag 1644 

actacttcag tttattccag caagaaagat tgtgatgcaa gatttctttc ttcctgagac 1704 

aaaaaaaaaa aaagaaaaaa aaatcttttc aaaatttact ttgtcaaata aaaatttgtt 1764 

tttcagagat ttaccaccca gttcatggtt agaaatattt tgtggcaatg aagaaaacac 1824 

tacggaaaat aaaaaataag ataaagcctt 1354 

<210> 40 
<211> 524 
<212> PRT 
<213> H. sapiens 



<400> 40 



Met Ser 


Val 


Lys Trp 


Thr 


Ser 


Val 


He 


Leu 


Leu 


T1 Q 

ixe 


Gin 


Leu Ser 


Phe 


1 






5 










10 








15 




Cys Phe 


Ser 


Ser Gly 


Asn 


Cys 


la Ay 


Lys 


Val 


Tan 


vax 


Trp Ala Ala 


Glu 






20 










25 










30 




Tyr Ser 


His 


Trp Met 


sn 


Tie 
x j.e 


1 ve 


inr 


ixe 


Leu 


Asp 


Glu 


Leu He 


Gin 




35 










40 








45 






Arg Gly 


His 


Glu 


Val 


Thr 


Val 


Leu 


Ala 


Co- 

i>er 


Ser 


Ala 


Ser 


He Leu 


Phe 


50 










55 










en 
OU 






Asp Pro 


Asn 


Asn 


Ser 


Ser 


Ala 


eu 


T vie 


Tin 

lie 


ulu 


lie 


Tyr Pro Thr 


Ser 


65 








70 










*7*\ 
/ j 








80 


Leu Thr 


Lvs 


Thr 


Glu 




Glu 


Asn 


Phe 


T 1 *» 
1 xe 




oxn 


Gin He Lys 


Arg 








85 










90 








95 


Trp Ser 


Asp 


Leu 


Pro 


Lys 


Asp 


Thr 


Phe 


irp 


T en 
lieu 


Tyr 


Phe 


Ser Gin 


Val 






100 










105 








110 




Gin Glu 


He 


Met 


Ser 


He 


Phe 


Glv 


Asp 


He 


Thr 


Arg 


Lys 


Phe Cys 


Lys 




115 










120 








125 


Asp Val 


Val 


Ser 


Asn 


Lvs 


Lys 


Phe 


Met 


- 

ys 


ys 


Val 


Gin 


Glu Ser 


Arg 


130 










135 








140 






Phe Asp 


Val 


He 


Phe 


Ala 


Asp 


Ala 


He 


Phe 


Pro 


Cys 


Ser 


Glu Leu 


Leu 


145 








150 










155 






160 


Ala Glu 


Leu 


Phe 


Asn 
165 


He 


Pro 


Phe 


va j. 


Tyr 
170 


Ser 


Leu 


Ser 


Phe Ser 
175 


Pro 


Gly Tyr 


Thr 


Phe 
180 


Glu 


Lys 


His 


Ser 


Gly 
185 


Gly 


Phe 


He 


Phe 


Pro Pro 
190 


Ser 


Tyr Val 


Pro 


Val 


Val 


Met 


Ser 


Glu 


Leu 


Thr 


Asp 


Gin 


Met 


Thr Phe 


Met 




195 










200 








205 






Glu Arg 


Val 


Lys Asn 


Met 


He 


Tyr 


Val 


Leu 


Tyr 


Phe 


Asp Phe Trp 


Phe 


210 










215 










220 








Glu He 


Phe 


Asp Met 


Lys 


Lys 


Trp 


Asp 


Gin 


Phe 


Tyr 


Ser 


Glu Val 


Leu 


225 








230 










235 






240 


Gly Arg 


Pro 


Thr 


Thr 


Leu 


Ser 


Glu 


Thr 


Met 


Gly 


Lys 


Ala Asp Val 


Trp 








245 










250 








255 


Leu He 


Arg 


Asn 


Ser 


Trp 


Asn 


Phe 


Gin 


Phe 


Pro 


Tyr 


Pro 


Leu Leu 


Pro 






260 










265 








270 




Asn Val 


Asp 


Phe 


Val 


Gly 


Gly 


Leu 


His 


Cys 


Lys 


Pro 


Ala 


Lys Pro 


Leu 




275 










280 










285 




Pro Lys 


Glu 


Met 


Glu 


Asp 


Phe 


Val 


Gin 


Ser 


Ser 


Gly 


Glu Asn Gly 


Val 


290 










295 










300 








Val Val 


Phe 


Ser 


Leu 


Gly 


Ser 


Met 


Val 


Ser 


Asn 


Met 


Thr 


Glu Glu 


Arg 


305 








310 










315 








320 


Ala Asn 


Val 


He 


Ala 
325 


Ser 


Ala 


Leu 


Ala 


Gin 
330 


He 


Pro 


Gin 


Lys Val 
335 


Leu 


Trp Arg 


Phe 


Asp Gly 


Asn 


Lys 


Pro 


Asp 


Thr 


Leu 


Gly 


Leu 


Asn Thr 


Arg 






34 0 










345 










350 


Leu Tyr 


Lys 


Trp He 


Pro 


Gin 


Asn 


Asp 


Leu 


Leu 


Gly 


His 


Pro Lys 


Thr 




355 










360 








365 




Arg Ala 


Phe 


He 


Thr 


His 


Gly 


Gly 


Ala 


Asn 


Gly 


He 


Tyr Glu Ala 


He 


370 










375 








380 








Tyr His 


Gly 


He 


Pro 


Met 


Val 


Gly 


He 


Pro 


Leu 


Phe 


Ala Asp Gin 


Pro 


385 








390 










395 








400 
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Asp 


Asn 


lie 


Ala 


His Met 


Lys Ala 


Arg 


Gly Ala 


Ala 


Val 


Arg 


Val 


Asp 










405 






410 






415 


Phe 


Asn 


Thr 


Met 


Ser Ser 


Thr Asp 


Leu 


Leu Asn 


Ala 


Leu 


Lys 


Arg 


Val 


lie 






420 






425 








430 




Asn 


Asp 


Pro 


Ser Tyr 


Lys Glu 


Asn 


Val Met 


Lys 


Leu 


Ser 


Arg 


He 






435 






440 








445 






Gin 


His 


Asp 


Gin 


Pro Val 


Lys Pro 


Leu 


Asp Arg 


Ala 


Val 


Phe 


Trp 


He 




450 








455 




460 








Glu 


Phe 


Val 


Met 


Arg His 


Lys Gly 


Ala 


Lys His 


Leu 


Arg 


Val 


Ala 


Ala 


4 65 








470 






475 








480 


His 


Asp 


Leu 


Thr 


Trp Phe 


Gin Tyr 


His 


Ser Leu 


Asp 


Val 


He 


Gly 


Phe 










485 






490 






495 




Leu 


Leu 


Val 


Cys 


Val Ala 


Thr Val 


lie 


Phe He 


Val 


Thr 


Lys 


Cys 


Cys 








500 






505 








510 


Leu 


Phe 


Cys 


Phe 


Trp Lys 


Phe Ala 


Arg 


Lys Ala 


Lys 














515 






520 















<210> 41 
<211> 1686 
<212> DNA 
<213> H. sapiens 

<220> 

<221> exon 

<222> (392),.. (1126) 

<400> 41 

tccccagttt cacaaaaata tgtggaccat gtttagtcat ttaatcttta gttttgtgtc 60 

aaatggactg cagaaacaag atctgtcact gctactgttc tggacactct tctaaaatat 120 

attgcataag acagatggca tgtccataca agatccttga tattagctga aggatagcac 180 

tcataaacat aaaagggaaa ttaatcacat ctgtgtgaac agatcattta ccttcatttg 240 

tctctttgcc atccacatgc tcagactgtt gatttaatga tattgtatgt actttgactt 300 

ataagggtta cattttaact tcttggctaa tttatctttg gacataacca tgagaaatga 360 

cagaaaggaa cagcaactgg aaaacaagca ttgcattgca ccaggatgtc tgtgaaatgg 420 

acttcagtaa ttttgctaat acaactgagc ttttgcttta gctctgggaa ttgtggaaag 480 

gtgctggtgt gggcagcaga atacagccat tggatgaata taaagacaat cctggatgag 540 

cttattcaga gaggtcatga ggtgactgta ctggcatctt cagcttccat tctttttgat 600 

cccaacaact catccgctct taaaattgaa atttatccca catctttaac taaaactgag 660 

ttggagaatt tcatcatgca acagattaag agatggtcag accttccaaa agatacattt 720 

tggttatatt tttcacaagt acaggaaatc atgtcaatat ttggtgacat aactagaaag 780 

ttctgtaaag atgtagtttc aaataagaaa tttatgaaaa aagtacaaga gtcaagattt 840 

gacgtcattt ttgcagatgc tatttttccc tgtagtgagc tgctggctga gctatttaac 900 

ataccctttg tgtacagtct cagcttctct cctggctaca cttttgaaaa gcatagtgga 960 

ggatttattt tccctccttc ctacgtacct gttgttatgt cagaattaac tgatcaaatg 1020 

actttcatgg agagggtaaa aaatatgatc tatgtgcttt actttgactt ttggttcgaa 1080 

atatttgaca tgaagaagtg ggatcagttt tatagtgaag ttctaggtaa gtattttttt 1140 

caatcagtaa catgaagctc taacttattt gtgtctttga agcagagctt atataaagcc 1200 

ataaagtcag ggtagtgggg ttttggtaag tgaatttata aaacaaaaat acaagatgat 1260 

ctattaatct cacaaatatt atagaaaagc ttaaattaca gggtcagtta aaaccctgtg 1320 

gccatcactc acacagaaca ccccaggaaa tcataaacct atacattagt gcatctaaga 1380 

ctttaagcaa ttacacatct gttttactat acattgtttt acatcttaaa aacagtaaaa 1440 

tccatcaaat aacttcttac tgaatgcata gatttagaat gagtagttac acatttttct 1500 

acaactatct atataactgc agaaattgtt ttttcttgta aacttgtttt cttatttaga 1560 

aatcaaaaga tgt toccata ttaccagaag gtttccttca cagtaaagag agataatgtc 1620 

tatacctcag atgcaaaaat caataagggc aatttgaagt ttctaatgtt tctatactct 1680 

tgcagg 1686 

<210> 42 
<211> 1340 
<212> DNA 
<213> H. sapiens 
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<220> 

<221> exon 

<222> (668) . (816) 

<400> 42 

atagtttttg gaactaggcc cctttattag aacatatgag acaattaagg tggagtacaa 60 

tttttatttc ataatttctc aaaaatttct agctataatg tacaaatata tttacttaaa 120 

aatattatta agatcttagc ttgaatctaa aagagtagtt ggtacaagga tttcagccat 180 

actctcaaca tagtccacag ttcacttgaa ccaaagataa aagaattagc ttaatgagtt 240 

gtgtaaacta gactatttct tagaaaatta tttttatggg tagagtagaa ttaattgatt 300 

atggagctca aagagttgtt taaatgtccg tatgctacta ttgaagcttt aagagaaaag 360 

aaattttatg tttaactttc tatggctcat tttaataatt gtttatgatt atgagcatac 420 

tgatgcgaca ttagagatgt agcttaacct cacaattctc ctactacttt gtctttctta 480 

taaatacaca tgggcaaaat atgtaataca taaaattaaa ttatatctat atatgaatat 540 

gtgtatatat ttttcaaagc acagatattt gcctacattt ttgcctacat tattctaacc 600 

cctttcagaa atttacctaa agtaattatc ttgtgtcatc cacctttttt ttttctattc 660 

ctgtcaggaa gacccactac attatctgag acaatgggga aagctgacgt atggcttatt 720 

cgaaactcct ggaattttca gtttccatat ccactcttac caaatgttga ttttgttgga 780 

ggactccact gcaaacctgc caaacccctg cctaaggtaa acatactttt gttggtttta 840 

ttttgttggc tttgaatttt cagtagaaat gattctatag tcttctttca gagtgtttga 900 

cttacactga aagaaagatg ggaaatgggt ggggtaaagc agataccaat tagaaactca 960 

tgtgcacgtt aataccatca cacgtatatg agttttatga gtattacaaa tagagaggaa 1020 

tactaaggag actttgaaaa tagggttggt taaattaaag tcttcattat gcaataccta 1080 

agaaggtatt ggtcatccaa tcaaataata tttacaaagg gattagcaca aaacacaggt 1140 

aagtgcagaa ttttcagaga aaaaaataga cacagtttct gtccccacat accttacatt 1200 

ctacttcaaa agatagaata tgtgcaagta ataaaaatta tataaaaact attatctgaa 1260 

ggaaaaacgc aataccaaga aagcatcagt ggagataata gaaagtatcc tgcagtcact 1320 

gattagtaag atgggtaccg " 1340 

<210> 43 
<211> 1822 
<212> DNA 
<213> H. sapiens 

<220> 

<221> exon 

<222> (732)... (863) 

<400> 43 

tatatacaat gtctgtatga taaatgagac tcctggcact aattcataga aattccaaat 60 

tacattacca gactccagaa tgtcagcggt tcttaaccac cagcttttat ttattttatt 120 

ttttttagtt tttgaaaaac taccagaaaa ctctgaacaa actttaagtg aagtataaag 1B0 

cattgtagag aaacataaat gtagatataa aattatccca actgtgagta gcttatcctc 240 

agagctcata gttagggaag taaaccacta actgtttcca actaagagaa ttctacagaa 300 

aacctgcctg aaataaacac aagggattta gtagaacaac aatataggat taaagctgag 360 

tggtcccact ttccaagaac ctatattagt aactttagta atgaaagtga agagtcgtgt 420 

attaatattt ttaacattat ctccctgaca acaatgtaat agctccattt cttttctccc 480 

ttacacacat gcacacaaat acatacacat acacacatat ttacacaaat atccttaaca 540 

gcatccacct atctcatatt atacatctac ttgcaaaaaa actgagtgat tgggtcagtt 600 

aaaaaatatt atttactcca ataattcctc aaaatactgg attttctctc tttagtaatt 660 

tgcaccaatt cttttggtag tgcccgctgt gctaatactc ttttgtgatg aagcaaattc 720 

tttcttcaca ggaaatggaa gactttgtac agagctctgg agaaaatggt gttgtggtgt 780 

tttctctggg gtcaatggtc agtaacatga cagaagaaag ggccaacgta attgcatcag 840 

ccctggccca gatcccacaa aaggtaagat gaagtgcctt actggtgtgg aaaactactg 900 

aaagaggctg ttaaagtttg aagtaatcca attatagaaa cttctgataa atgtgaagtt 960 

gaccaaaagt tgaaaaatta gaacaaggat aatcttggag aaactatgag aagtttgaaa 1020 

attgtggttg catttttttt taaatggtgt taagtatgaa cattccccta tgta'aatatg 1080 

ctgacaataa attgaatgga gaaaggtatt taaaaagtgt ttggagactt ctcacctcct 1140 

gtccataaaa ttttgaattg tgtatgtgat ctacatagga aaggatatta aagagtagat 1200 

tgaactcttc catagctgaa tatagcctta aatatgcttg tatagcatcc accgacagaa 1260 

gtaatagttg tgcctcagac ttaggggttg catgtggccc tggaggagtt actacccttg 1320 

gtatgcatga gtagttccta ttagcatcag tgggaactca gtactccata tgtattcaca 1380 
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aaaggcaact tgagacccac agttattttt aatttctgat attaacactc atacatactg 14 40 

ctgaatttaa ctcaatatat ttcagttaag tgaaaatggt gcttaatgta gtctttagaa 1500 

tgactttcag gtgttttcac aaaaaacgta tatccagaac tgtgtccttt tagaaataca 1560 

agtaaaattt ttgataatta gcttcaaaac agttttccta atctcagcag tatccaatga 1620 

gtgaagaaca cttgactgac tcttgggtca cctctattac ttattgtact ctggaagctc 1680 

ttggtgaatg tttacgatta tgggatgtag tatttctgtt tgcactttaa gtcaaatgct 1740 

tgtataaaat acgtgacaac aaatggagaa tattggctct gttagtagtt atgcggtata 1800 
ttctctgttt aaggatcttt gg " " X822 

<210> 44 
<211> 1591 
<212> DNA 
<213> H. sapiens 

<220> 

<221> exon 

<222> (138)... (225) 

<221> exon 

<222> (1067) ... (1286) 
<400> 44 

attctattta cattagcctt tgagtagttc ttatttacta acatcccttg atctcattcc 60 

tactctttat acagttctca cattctataa cttttgaatt ccactcatgg aataaaatat 120 

tttctttatt gtaacaggtt ctgtggagat ttgatgggaa taaaccagat accttaggtc 1B0 

tcaatactcg gctctacaag tggatacccc agaatgacct tctaggtaag actctggtga 240 

acaaatactg aatatattag taacagcaca ttagagtgtt aatagttcat catgaaacaa 300 

gcttattgaa tatttgttaa ggaaaaacaa aatgtaactt ctttatattg attttccagt 360 

cttaagggag aaagaataca ttataatttt tggcatttta tgatatacac ccacattctt 420 

tatagtctga atcgggggaa tctttatttc aggtgttatt atatctcaca aaatttttca 480 

ataacttcct gggctgtctc tctgtctcct atttctacaa ctttacacct gtttttttcc 540 

tctcccgcag ggttatttga aatgccacta aaaataatag ctcttctatc accagtgact 600 

ctgtattttc tgaagaatta aactgctaat cttaatcata cagtgatgat acatttcacg 660 

atgaagtgtg acctgtcctt cctcaatcct agcaccacca ccaaaccact gcctgctgcc 720 

ttgcccaccc catatatcac actctgtgac tgtcacttaa aataagagtt cacttcatgc 780 

ctatctcttt gctgtcttct tttttgcaca tttttgaaat ctagaatgca atttttcatt ■ 840 

agcccaactg gaaatcttgt attgttttgc agtctgaagt cacacacacc gtatagcctt 900 

cagttacata cccagtacaa gtacgtgttt tttcctccga agtctgaaac acaattttaa 960 

tttagttcag tgttttagct ggaaaacact gtcactttca gagcctttca ttgtgcatct 1020 

cattttattc ctatgagtaa ttttgctaaa attcatccaa tcctaggtca tccaaagacc 1080 

agagctttta taactcatgg tggagccaat ggcatctacg aggcaatcta ccatgggatc 1140 

cctatggtgg ggattccatt gtttgccgat caacctgata acattgctca catgaaggcc 1200 

aggggagcag ctgttagagt ggacttcaac acaatgtcga gtacagactt gctgaatgca 1260 

ttgaagagag taattaatga tccttcgtga gtagaacaat atttttcact aggtggtatt 1320 

tacagatagc ttctcttgtc aatagtgagt gtgagtttca tcctttttat aagagactaa 1380 

ttttgaaaga atttaatgat ttaaccaatc tgaaatctgc ttttattttt ataagttatt 1440 

taaaaattga atttgaaaca catacatcta aagaatagcc agttagtgaa acaattttct 1500 

acacaaaaat aattttaaaa ggatatagat aatacaaaaa atacatttct taaaaatttg 1560 
acataattaa tccatagaag aaaggaagaa t 1591 

<210> 45 
<211> 596 
<212> DNA 
<213> H. sapiens 

<220> 

<221> exon 

<222> (19)... (549) 

<400> 45 

ctttattttt atctttcaga tataaagaga atgttatgaa attatcaaga attcaacatg 60 

atcaaccagt gaagcccctg gatcgagcag tcttctggat tgaatttgtc atgcgccaca 120 
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aaggagctaa acaccttcgg gttgcagccc acgacctcac ctggttccag taccactctt 180 

tggatgtgat tgggttcctg ctggtctgtg tggcaactgt gatatttatc gtcacaaaat 240 

gttgtctgtt ttgtttctgg aagtttgcta gaaaagcaaa gaagggaaaa aatgattagt 300 

tatatctgag atttgaagct ggaaaacctg ataggtgaga ctacttcagt ttattccagc 360 

aagaaagatt gtgatgcaag atttctttct tcctgagaca aaaaaaaaaa aagaaaaaaa 420 

aatcttttca aaatttactt tgtcaaataa aaatttgttt ttcagagatt taccacccag 480 

ttcatggtta gaaatatttt gtggcaatga agaaaacact acggaaaata aaaaataaga 540 

taaagcctta tgagctcgta ttgaaatttg ttgaacttat atcgcggatc ctactg 596 

<210> 46 
<211> 20 
<212> DNA 
<213> H. sapiens 

<400> 46 

cttggctaat ttatctttgg 20 

<210> 47 
<211> 19 
<212> DNA 
<213> H. sapiens 

<400> 47 

cccactaccc tgactttat 19 

<210> 48 
<211> 20 
<212> DNA 
<213> H. sapiens 

<400> 48 

ggacataacc atgagaaatg 20 

<210> 49 
<211> 19 
<212> DNA 
<213> H. sapiens 

<400> 49 

agctctgctt caaagacac 19 

<210> 50 
<211> 21 
<212> DNA 
<213> H. sapiens 

<400> 50 

tgtccgtatg ctactattga a 21 

<210> 51 
<211> 21 
<212> DNA 
<213> H. sapiens 

<400> 51 

tgtgctaatc cctttgtaaa t 21 

<210> 52 
<211> 22 
<212> DNA 
<213> H. sapiens 
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<400> 52 

tttttttttc tattcctgtc ag 22 

<210> 53 
<211> 17 
<212> DNA 
<213> H. sapiens 

<400> 53 

ctttacccca cccattt 17 

<210> 54 
<211> 20 
<212> DNA 
<213> H. sapiens 

<40O> 54 

cccttgatct cattcctact 20 

<210> 55 
<211> 24 
<212> DNA 
<213> H. sapiens 



<210> 56 
<211> 25 
<212> DNA 
<213> H. sapiens 

<400> 56 

cattcctact ctttatacag ttctc 25 

<210> 57 

<211> 17 

<212> DNA 

<213> H. sapiens 



<210> 58 
<211> 20 
<212> DNA 
<213> H. sapiens 

<400> 58 

cccttgatct cattcctact 20 

<210> 59 
<211> 24 
<212> DNA 
<213> H. sapiens 



<400> 55 
aactggctat tctttagatg tatg 



24 



<400> 57 
cccccgattc agactat 



17 



<40O> 59 
aactggctat tctttagatg tatg 



24 



<210> 60 
<211> 18 
<212> DNA 
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<213> H. sapiens 

<400> 60 
tcctccgaag tctgaaac 

<210> 61 
<2U> 23 
<212> DNA 
<213> H. sapiens 

<400> 61 
tataaaaagg atgaaactca cac 

<210> 62 
<211> 18 
<212> DNA 
<213> H. sapiens 

<400> 62 
caagccccca agttatgt 

<210> 63 

<211> 20 

<212> DNA 

<213> H. sapiens 

<400> 63 
cagtaggatc cgcgatataa 

<210> 64 

<211> 20 

<212> DNA 

<213> H. sapiens 

<400> 64 
tctgaggggt tttgtctgta 

<210> 65 
<211> 20 
<212> DNA 
<213> H. sapiens 

<400> 65 
ccgcgatata agttcaacaa 

<210> 66 
<211> 20 
<212> DNA 
<213> H. sapiens 

<400> 66 
ggacataacc atgagaaatg 

<210> 67 
<211> 19 
<212> DNA 
<213> H. sapiens 

<400> 67 
ttaagagcgg atgagttgt 

<210> 68 
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18 



23 



18 



20 



20 



20 



20 
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<211> 20 

<212> DNA 

<213> H. sapiens 



<400> 68 
tcatcatgca acagattaag 



20 



<210> 69 
<211> 20 
<212> DNA 
<213> H. sapiens 

<4 00> 69 

cactacaggg aaaaatagca 20 

<210> 70 
<211> 20 
<212> DNA 
<213> H. sapiens 

<400> 70 

accctttgtg tacagtctca 20 

<210> 71 
<211> 19 
<212> DNA 
<213> H. sapiens 



<210> 72 
<211> 21 
<212> DNA 
<213> H. sapiens 

<400> 72 

ttgcctacat tattctaacc c 21 

<210> 73 
<211> 17 
<212> DNA 
<213> H. sapiens 



<210> 74 
<211> 25 
<212> DNA 
<213> H. sapiens 

<400> 74 

cattcctact ctttatacag ttctc 25 

<210> 75 
<211> 17 
<212> DNA 
<213> H. sapiens 



<400> 71 
agctctgctt caaagacac 



19 



<400> 73 
ctttacccca cccattt 



17 



<400> 75 
cccccgattc agactat 



17 



•21 
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<210> 76 
<211> 25 
<212> DNA 
<213> H. sapiens 

<400> 76 
cattcctact ctttatacag ttctc 

<210> 77 
<211> 17 
<212> DNA 
<213> H. sapiens 

<400> 77 
cccccgattc agactat 

<210> 78 
<211> IB 
<212> DNA 
<213> H. sapiens 

<400> 78 
tcctccgaag tctgaaac 

<210> 79 
<211> 23 
<212> DNA 
<213> H. sapiens 

<400> 79 
tataaaaagg atgaaactca cac 

<210> 80 
<211> 20 
<212> DNA 
<213> H. sapiens 

<400> 80 
tctgaggggt tttgtctgta 

<210> 81 
<211> 22 
<212> DNA 
<213> H. sapiens 

<400> 81 
ttttttgtct caggaagaaa ga 

<210> 82 
<211> 24 
<212> DNA 
<213> H. sapiens 

<400> 82 
aaaaaaagaa aaaaaaatct tttc 

<210> 83 
<211> 20 
<212> DNA 
<213> H. sapiens 
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<400> 83 
ccgcgatata agttcaacaa 

<210> 84 
<211> 22 
<212> DNA 
<213> H. sapiens 

<400> 84 
tgcattgcac caggatgtct gt 

<210> 85 
<211> 21 
<212> DNA 
<213> H. sapiens 

<400> 85 
gcattgcacc aagatgtctg t 

<210> 86 
<211> 23 
<212> DNA 
<213> H. sapiens 

<400> 86 
tcctggatga gcttattcag aga 

<210> 87 
<211> 23 
<212> DNA 
<213> H. sapiens 

<400> 87 
tcctggatga gcctattcag aga 

<210> 88 
<211> 21 
<212> DNA 
<213> H. sapiens 

<400> 88 
cattttggtt atatttttca c 

<210> 89 
<211> 21 
<212> DNA 
<213> H. sapiens 

• <4D0> 89 
cattttggtt ttatttttca c 

<210> 90 
<211> 21 
<212> DNA 
<213> H. sapiens 

<400> 90 
cataactaga aagttctgta a 

<210> 91 
<211> 21 
<212> DNA 
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<213> H. sapiens 

<4 00> 91 
cat a acta gg aagttctgta a 

<210> 92 
<211> 20 
<212> DNA 
<213> H . sapiens 

<4O0> 92 
cctggctaca cttttgaaaa 

<210> 93 

<211> 20 

<212> DNA 

<213> H. sapiens 

<400> 93 
cctggctaca tttttgaaaa 

<210> 94 
<211> 21 
<212> DNA 
<213> H. sapiens 

<4O0> 94 
gaagacccac tacattatct g 

<210> 95 
<211> 21 
<212> DNA 
<213> H. sapiens 

<400> 95 
gaagacccac tacgttatct g 

<210> 96 
<211> 26 
<212> DNA 
<213> H. sapiens 

<400> 96 
aattttcagt ttccatatcc actctt 

<210> 97 
<211> 26 
<212> DNA 
<213> H. sapiens 

<400> 97 
aattttcagt ttcctcatcc actctt 

<210> 98 
<211> 22 
<212> DNA 
<213> H. sapiens 

<4O0> 98 
taggtctcaa tactcggctc ta 

<210> 99 
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21 



20 



20 



21 



21 



26 



26 
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<211> 22 

<212> DNA 

<213> H. sapiens 

<400> 99 
taggtctcaa tactcggctg ta 

<210> 100 
<211> 19 
<212> DNA 
<213> H. sapiens 

<400> 100 
tacaagtgga taccccaga 

<210> 101 

<211> 19 

<212> DNA 

<213> H. sapiens 

<400> 101 
tataagtgga taccccaga 

<210> 102 

<211> 26 

<212> DNA 

<213> H. sapiens 

<400> 102 
gggagaaaga atacattata attttt 

<210> 103 
<211> 25 
<212> DNA 
<213> H. sapiens 

<400> 103 
gggagaaaga atacttataa ttttt 

<210> 104 
<211> 21 
<212> DNA 
<213> H. sapiens 

<400> 104 
ttccattgtt tgccgatcaa c 

<210> 105 
<211> 21 
<212> DNA 
<213> H. sapiens 

<400> 105 
ttccattgtt tgctgatcaa c 

<210> 106 
<211> 21 
<212> DNA 
<213> H. sapiens 

<400> 106 
gaatgcattg aagagagtaa t 
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21 
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<210> 107 
<211> 21 
<212> DNA 
<213> H. sapiens 

<400> 107 

gaatgcattg cagagagtaa t 21 

<210> 108 
,<211> 22 
<212> DNA 
<213> H. sapiens 

<400> 108 

ctggtctgtg tggcaactgt ga 22 

<210> 109 

<211> 22 

<212> DNA 

<213> H. sapiens 

<400> 109 

ctggtctgtg tggcgactgt ga 22 

<210> 110 
<211> 19 
<212> DNA 
<213> H. sapiens 

<400> 110 

taagataaag ccttatgag 19 

<210> 111 
<211> 19 
<212> DNA 
<213> H. sapiens 

<400> 111 

taagataaag acttatgag 19 

<210> 112 
<211> 1976 
<212> DNA 
<213> H, sapiens 

<220> 
<221> CDS 

<222> (11) . (1598) 
<400> 112 

taagaccagg atg tct ctg aaa tgg acg tea gtc ttt ctg ctg ata cag 4 9 

Met Ser Leu Lys Trp Thr Ser Val Phe Leu Leu He Gin 
15 10 

etc agt tgt tac ttt age tct gga age tgt gga aag gtg eta gtg tgg 97 
Leu Ser Cys Tyr Phe Ser Ser Gly Ser Cys Gly Lys Val Leu Val Trp 
15 20 25 

ccc aca gaa tac age cat tgg ata aat atg aag aca ate ctg gaa gag 145 
Pro Thr Glu Tyr Ser His Trp He Asn Met Lys Thr He Leu Glu Glu 
30 35 40 45 
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ctt gtt cag agg ggt cat gag gtg act gtg ttg aca tct teg get tct 193 
Leu Val Gin Arg Gly His Glu Val Thr Val Leu Thr Ser Ser Ala Ser 
50 55 60 

act ctt gtc aat gee agt aaa tea tct get att aaa tta gaa gtt tat 241 
Thr Leu Val Asn Ala Ser Lys Ser Ser Ala lie Lys Leu Glu Val Tyr 
65 70 75 

cct aca tct tta act aaa aat gat ttg gaa gat tct ctt ctg aaa att 289 
Pro Thr Ser Leu Thr Lys Asn Asp Leu Glu Asp Ser Leu Leu Lys lie 
80 85 90 

etc gat aga tgg ata tat ggt gtt tea aaa aat aca ttt tgg tea tat 337 
Leu Asp Arg Trp lie Tyr Gly Val Ser Lys Asn Thr Phe Trp Ser Tyr 
95 100 105 

ttt tea caa tta caa gaa ttg tgt tgg gaa tat tat gac tac agt aac 385 
Phe Ser Gin Leu Gin Glu Leu Cys Trp Glu Tyr Tyr Asp Tyr Ser Asn 
110 115 120 " 125 

aag etc tgt aaa gat gca gtt ttg aat aag aaa ctt atg atg aaa eta. 433 
Lys Leu Cys Lys Asp Ala Val Leu Asn Lys Lys Leu Met Met Lys Leu 
130 135 140 

caa gag tea aag ttt gat gtc att ctg gca gat gec ctt aat ccc tgt 4 81 

Gin Glu Ser Lys Phe Asp Val He Leu Ala Asp Ala Leu Asn Pro Cys 
145 150 155 

ggt gag eta ctg get gaa eta ttt aac ata ccc ttt ctg tac agt ctt 529 
Gly Glu Leu Leu Ala Glu Leu Phe Asn He Pro Phe Leu Tyr Ser Leu 
160 165 170 

cga ttc tct gtt ggc tac aca ttt gag aag aat ggt gga gga ttt ctg 577 
Arg Phe Ser Val Gly Tyr Thr Phe Glu Lys Asn Gly Gly Gly Phe Leu 
175 180 185 

ttc cct cct tec tat gta cct gtt gtt atg tea gaa tta agt gat caa 625 
Phe Pro Pro Ser Tyr Val Pro Val Val Met Ser Glu Leu Ser Asp Gin 
190 195 200 205 

atg att ttc atg gag agg ata aaa aat atg ata cat atg ctt tat ttt 673 
Met He Phe Met Glu Arg He Lys Asn Met He His Met Leu Tyr Phe 
210 215 220 

gac ttt tgg ttt caa att tat gat ctg aag aag tgg gac cag ttt tat 721 
Asp Phe Trp Phe Gin He Tyr Asp Leu Lys Lys Trp Asp Gin Phe Tyr 
225 230 235 

agt gaa gtt eta gga aga ccc act aca tta ttt gag aca atg ggg aaa 769 
Ser Glu Val Leu Gly Arg Pro Thr Thr Leu Phe Glu Thr Met Gly Lys 
240 245 250 

get gaa atg tgg etc att cga acc tat tgg gat ttt gaa ttt cct cgc 817 
Ala Glu Met Trp Leu He Arg Thr Tyr Trp Asp Phe Glu Phe Pro Arg 
255 260 265 

cca ttc tta cca aat gtt gat ttt gtt gga gga ctt cac tgt aaa cca 865 
Pro Phe Leu Pro Asn Val Asp Phe Val Gly Gly Leu His Cys Lys Pro 
270 275 280 285 

gee aaa ccc ctg cct aag gaa atg gaa gag ttt gtg cag age tct gga 913 
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Ala Lys Pro Leu Pro Lys Glu Met Glu Glu Phe Val Gin Ser Ser Gly 
290 295 300 

gaa aat ggt att gtg gtg ttt tct ctg ggg teg atg ate agt aac atg 961 
Glu Asn Gly He Val Val Phe Ser Leu Gly Ser Met He Ser Asn Met 
305 310 315 

tea gaa gaa agt gee aac atg att gca tea gee ctt gee cag ate cca 1009 
Ser Glu Glu Ser Ala Asn Met He Ala Ser Ala Leu Ala Gin He Pro 
320 325 330 

caa aag gtt eta tgg aga ttt gat ggc aag aag cca aat act tta ggt 1057 
Gin Lys Val Leu Trp Arg Phe Asp Gly Lys Lys Pro Asn Thr Leu Gly 
335 340 345 

tec aat act cga ctg tac aag tgg tta ccc cag aat gac ctt ctt ggt 
Ser Asn Thr Arg Leu Tyr Lys Trp Leu Pro Gin Asn Asp Leu Leu Gly 
350 355 360 365 

cat ccc aaa acc aaa get ttt ata act cat ggt gga acc aat ggc ate 1153 
His Pro Lys Thr Lys Ala Phe He Thr His Gly Gly Thr Asn Gly He 
370 375 380 

tat gag gcg ate tac cat ggg ate cct atg gtg ggc att ccc ttg ttt 1201 
Tyr Glu Ala He Tyr His Gly He Pro Met Val Gly He Pro Leu Phe 
385 390 395 

gcg gat caa cat gat aac att get cac atg aaa gee aag gga gca gee 124 9 
Ala Asp Gin His Asp Asn lie Ala His Met Lys Ala Lys Gly Ala Ala 
400 405 410 

etc agt gtg gac ate agg acc atg tea agt aga gat ttg etc aat gca 1297 
Leu Ser Val Asp He Arg Thr Met Ser Ser Arg Asp Leu Leu Asn Ala 
415 420 425 

ttg aag tea gtc att aat gac cct gtc tat aaa gag aat gtc atg aaa 134 5 

Leu Lys Ser Val He Asn Asp Pro Val Tyr Lys Glu Asn Val Met Lys 
430 435 440 445 

tta tea aga att cat cat gac caa cca atg aag ccc ctg gat cga gca 1393 
Leu Ser Arg He His His Asp Gin Pro Met Lys Pro Leu Asp Arg Ala 
450 455 460 

gtc ttc tgg att gag ttt gtc atg cgc cac aaa gga gee aag cac ctt 1441 
Val Phe Trp He Glu Phe Val Met Arg His Lys Gly Ala Lys His Leu 
465 470 475 

cga gtc gca get cac aac etc acc tgg ate cag tac cac tct ttg gat 148 9 

Arg Val Ala Ala His Asn Leu Thr Trp He Gin Tyr His Ser Leu Asp 
480 485 4 90 

gtg ata gca ttc ctg ctg gee tgc gtg gca act gtg ata ttt ate ate 1537 
Val He Ala Phe Leu Leu Ala Cys Val Ala Thr Val He Phe He He 
495 500 505 

aca aaa ttt tgc ctg ttt tgt ttc cga aag ctt gee aaa aca gga aag 1585 
Thr Lys Phe Cys Leu Phe Cys Phe Arg Lys Leu Ala Lys Thr Gly Lys 
510 515 520 525 

aag aag aaa aga g attagttata teaaaagect gaagtggaat gactgaaaga 1638 
Lys Lys Lys Arg 
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tgggactcct cctttatttc agcatggagg gttttaaatg gaggatttcc tttttcctgt 1698 

gacaaaacat cttttcacta cttaccttgt taagacaaaa tttattttcc agggatttaa 1758 

tacgtacttt agttggaatt attctatgtc aatgattttt aagctatgaa aaatacaatg 1818 

gggggaagga tagcatttgg agatatacct aatgttaaat gacgagttac tggatgcagc 1878 

acgccaacat ggcacatgta tacatatgta gctaacctca cgttgtgcac atgtacccta 1938 

aaacttaaag tataatttaa aaaaagcaaa gggtaccg 1976 

<210> 113 
<211> 530 
<212> PRT 
<213> H. sapiens 



<400> 113 



Met 


Ser 


Leu Lys Trp 


Thr 


Ser 


Val 


Phe 


Leu 


Leu He Gin Leu Ser Cys 


1 


Phe 


5 










10 


15 


Tyr 


Ser Ser Gly 


Ser 


Cys 


Gly 


Lys 


Val 


Leu Val Trp Pro Thr Glu 






20 








25 




30 


Tyr 


Ser 


His Trp lie 


Asn 


Met 


Lys 


Thr 


He 


Leu Glu Glu Leu Val Gin 






35 






40 






45 


Arg 


Gly 


His Glu Val 


Thr 


Val 


Leu 


Thr 


Ser 


Ser Ala Ser Thr Leu Val 




50 






55 








60 


Asn 


Ala 


Ser Lys Ser 


Ser 


Ala 


lie 


Lys 


Leu 


Glu Val Tyr Pro Thr Ser 


65 






70 










75 80 


Leu 


Thr 


Lys Asn Asp 


Leu 


Glu 


Asp 


Ser 


Leu 


Leu Lys He Leu Asp Arg 






85 










90 


95 


Trp 


He 


Tyr Gly Val 


Ser 


Lys 


Asn 


Thr 


Phe 


Trp Ser Tyr Phe Ser Gin 






100 








105 




110 


Leu 


Gin 


Glu Leu Cys 


Trp 


Glu 


Tyr 


Tyr 


Asp 


Tyr Ser Asn Lys Leu Cys 






115 






120 






125 


Lys 


Asp 


Ala Val Leu 


Asn 


Lys 


Lys 


Leu 


Met 


Met Lys Leu Gin Glu Ser 




130 






135 








140 


Lys 


Phe 


Asp Val He 


Leu 


Ala 


Asp 


Ala 


Leu 


Asn Pro Cys Gly Glu Leu 


145 






150 










155 160 


Leu 


Ala 


Glu Leu Phe 


Asn 


He 


Pro 


Phe 


Leu 


Tyr Ser Leu Arg Phe Ser 


Val 




165 










170 


175 


Gly 


Tyr Thr Phe 


Glu 


Lys 


Asn 


Gly 


Gly 


Gly Phe Leu Phe Pro Pro 






180 








185 




190 


Ser 


Tyr 


Val Pro Val 


Val 


Met 


Ser 


Glu 


Leu 


Ser Asp Gin Met He Phe 






195 






200 






205 


Met 


Glu 


Arg He Lys 


Asn 


Met 


He 


His 


Met 


Leu Tyr Phe Asp Phe Trp 




210 






215 








220 


Phe 


Gin 


He Tyr Asp 


Leu 


Lys 


Lys 


Trp 


Asp 


Gin Phe Tyr Ser Glu Val 


225 






230 








235 240 


Leu 


Gly 


Arg Pro Thr 


Thr 


Leu 


Phe 


Glu 


Thr 


Met Gly Lys Ala Glu Met 






245 










250 


255 


Trp 


Leu 


He Arg Thr 


Tyr 


Trp 


Asp 


Phe 


Glu 


Phe Pro Arg Pro Phe Leu 






260 








265 




270 


Pro 


Asn 


Val Asp Phe 


Val 


Gly 


Gly 


Leu 


His 


Cys Lys Pro Ala Lys Pro 






275 






280 






285 


Leu 


Pro 


Lys Glu Met 


Glu 


Glu 


Phe 


Val 


Gin 


Ser Ser Gly Glu Asn Gly 




290 






295 








300 


He 


Val 


Val Phe Ser 


Leu 


Gly 


Ser 


Met 


He 


Ser Asn Met Ser Glu Glu 


305 






310 










315 320 


Ser 


Ala 


Asn Met He 


Ala 


Ser 


Ala 


Leu 


Ala 


Gin He Pro Gin Lys Val 






325 










330 


335 


Leu 


Trp 


Arg Phe Asp 


Gly 


Lys 


Lys 


Pro 


Asn 


Thr Leu Gly Ser Asn Thr 






340 








345 




350 


Arg 


Leu 


Tyr Lys Trp 


Leu 


Pro 


Gin 


Asn 


Asp 


Leu Leu Gly His Pro Lys 






355 






360 






365 


Thr 


Lys 


Ala Phe He 


Thr 


His 


Gly 


Gly 


Thr 


Asn Gly He Tyr Glu Ala 




370 






375 








380 
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He 


Tyr 


His 


Gly 


He Pro Met 


Val 


Gly 


He 


Pro 


Leu 


Phe 


Ala 


Asp 


Gin 


385 








390 








395 








H UU 


His 


Asp 


Asn 


He 


Ala His Met 


Lys 


Ala 


Lys 


Gly 


Ala 


Ala 


Leu 


Ser 


Val 










405 






410 












Asp 


lie 


Arg 


Thr 


Met Ser Ser 


Arg 


Asp 


Leu 


Leu 


Asn 


Ala 


Leu 


Lys 


Ser 








420 






425 










430 




Val 


lie 


Asn 


Asp 


Pro Val Tyr 


Lys 


Glu 


Asn 


Val 


Met 


Lys 


Leu 


Ser 


Arg 






435 






440 










445 






lie 


His 


His 


Asp 


Gin Pro Met 


Lys 


Pro 


Leu 


Asp 


Arg 


Ala 


Val 


Phe 


Trp 




450 






455 










4 60 








He 


Glu 


Phe 


Val 


Met Arg His 


Lys 


Gly 


Ala 


Lys 


His 


Leu 


Arg 


Val 


Ala 


465 








470 








475 








480 


Ala 


His 


Asn 


Leu 


Thr Trp He 


Gin 


Tyr 


His 


Ser 


Leu 


Asp 


Val 


He 


Ala 










485 






4 90 




lie 




495 




Phe 


Leu 


Leu 


Ala 


Cys Val Ala 


Thr 


Val 


He 


Phe 


He 


Thr 


Lys 


Phe 








500 






505 










510 




Cys 


Leu 


Phe 


Cys 


Phe Arg Lys 


Leu 


Ala 


Lys 


Thr 


Gly 


Lys 


Lys 


Lys 


Lys 






515 






520 










525 


Arg 


Asp 
530 



























<210> 114 
<211> 2312 
<212> DNA 
<213> H. sapiens 

<220> 

<221> exon 

<222> (692) ... {1425] 

<400> 114 

accctcctgc tcccatctgc catgatcact ggaaaaccct catttatttt ttaaagggtc 60 

cagaaaatgc taatctatag agatagaaat tagattagtg gttgcctagg gtaggatgga 120 

tgcaaaattt cagagtgggg ggttagaggc tattgtatag aatcttttgg agataatact 180 

gattattgta gtgaaagtaa aattctgtga atatactagg aaacattgaa ctgtacacac 240 

taattggtga gtcatatggt atatgaatta tgtgtcaaca aagttttaga agacattact 300 

tgcaccacga tattaaaaaa tgccgtttga gttgtataat tacttcttct ctctatgtca 360 

agggcaccga acaggcagga gcctctcact tgccactgtt cttaacagta ttataaaata 420 

attacataag acaggttact tacatattct aggtcataaa aattattgct tgactagagt 480 

aattgtaaac ataaaagaac accaaacaca ctaaaataaa tatgaggtca tcaatctttt 540 

gttggtctcc ttggcatgca cctattcaga ctgttagtat tatgtattta cttcaaattt 600 

tagcagttat attttaactt gattgatttt tcctcagata taagtatgag aaatgacaga 660 

aagaaacaac aactggaaaa gaagcattgc ataagaccag gatgtctctg aaatggacgt 720 

cagtctttct gctgatacag ctcagttgtt actttagctc tggaagctgt ggaaaggtgc 780 

tagtgtggcc cacagaatac agccattgga taaatatgaa gacaatcctg gaagagcttg 840 

ttcagagggg tcatgaggtg actgtgttga catcttcggc ttctactctt gtcaatgcca 900 

gtaaatcatc tgctattaaa ttagaagttt atcctacatc tttaactaaa aatgatttgg 960 

aagattctct tctgaaaatt ctcgatagat ggatatatgg tgtttcaaaa aatacatttt 1020 

ggtcatattt ttcacaatta caagaattgt gttgggaata ttatgactac agtaacaagc 1080 

tctgtaaaga tgcagttttg aataagaaac ttatgatgaa actacaagag tcaaagtttg 1140 

atgtcattct ggcagatgcc cttaatccct gtggtgagct actggctgaa ctatttaaca 1200 

taccctttct gtacagtctt cgattctctg ttggctacac atttgagaag aatggtggag 1260 

gatttctgtt ccctccttcc tatgtacctg ttgttatgtc agaattaagt gatcaaatga 1320 

ttttcatgga gaggataaaa aatatgatac atatgcttta ttttgacttt tggtttcaaa ' 1380 

tttatgatct gaagaagtgg gaccagtttt atagtgaagt tctaggtaag tcatgtgtct 1440 

aactggtgct tattaagttc taacttttct gtgcctttga aggtgagctt atataaatat 1500 

aatgtcagaa gatagtgttt ttaagggaaa ttatgaattg caaatgtaag atgatctatc 1560 

agtctcaaaa atattataga atgttgacct tatagaatca gttagaaccc tggggccatc 1620 

actactacag gacacccaga gagtcataaa ccttcattgt aaagcactaa tgatttcttt 1680 

aaactatcac atatcatttt gctatacatt ttttcatctt taaaaaaagt caatagatac 1740 

ctcaagaaac atcttcatga aggcagacac ataaatttag tatttacaca tatttctaga 1800 

aaaattatca atgcaggatt gaggaatttg tttctctttg agttcctcag tttcctcatt 1860 
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tagaaattaa attttgtttt tcatgtaaga aggattcctt cacagttgag taatatagtg 1920 

gctctactcc agaaacagaa gcctaaaact tgagatttct aatgtttata cattccttca 1980 

ataacaggtt gacaattatt tctttcaaaa actgaaatct tgxtgaaagt gaacatctaa 2040 

gttttaatct atattttatt aaactgcatc tctccatcaa agaaaatagg ggccaaatta 2100 

agggagagca catatctcta tgtcaataaa ttctgaaaat gttttaattc tcatttgtaa 2160 

atatatttat tttaaaaatc taattatatt aagatcttac gatgaaccaa gacagtagta 2220 

ggtgtaaaga tttcagtgtt gagctcaaaa aactcatggt ttactttgag aaccaaggat 22B0 

caagggctag cttaataaac tgtagacact ag 2312 

<210> 115 
<211> 1021 
<212> DNA 
<213> H. sapiens 

<220> 

<221> exon 

<222> (413) ... (565) 

<400> 115. 

accatgatcc aatcacctgc cactgggtcc ctccctggac acatggggat tatggggatt 60 

ataattcaag atgagaggag atttgggtgg ggacagtcaa accatattag tgacttattt 120 

taataattat ttatgattgt gaatatactg atgttacatt aaagatgtga tttcttctta 180 

cagatctctg aatacattgc cttccttata tatacatatg agcaacatat gcaataaata 240 

aaatctaaat tatgactata tataaatgta tttatatata ttttatcaat gcacagacat 300 

tttatatatg tttgggtatg ttattccaag tcctttcagg aaaatacctg catattcaaa 360 

taacaattct cgtgttagct accttttgtt ttgttttgtt tttttccatc aggaagaccc 420 

actacattat ttgagacaat ggggaaagct gaaatgtggc tcattcgaac ctattgggat 480 

tttgaatttc ctcgcccatt cttaccaaat gttgattttg ttggaggact tcactgtaaa 540 

ccagccaaac ccctgcctaa ggtaaatgta ttcttgtttc atttgtttgc ttgacatttt 600 

cagaaggaat ggctggatat gtttctttca gagtgtttaa ctcagagtga ggggaatatg 660 

ggaggtcaaa aacaaggact tgccattaga aaatcatata tttctgtagt atcacaagta 720 

tgtgaatgtt attatcatta aagaccaaag aggtttacta gggagatttt gaaaacaggg 780 

ttggttaaag taaggccttc attgtgccac ccaaaagata gtatgattca tttcttcaaa 840 

aaatatttgt agagtgatta atacaaacca caggtaagtg ctggattttc agagaataaa 900 

ggtagcacag tttctgctcc ctcatgcctt acattgtact ttgaaagata gaataaaaac 960 

aagtgaaaaa gaaaagtcta aaaagtgtta ttaaggaaag accacaatga taaagaaata 1020 

t 1021 

<210> 116 
<211> 480 
<212> DNA 
<213> H. sapiens 

<220> 

<221> exon 

<222> (43) . . . (174) 

<400> 116 

tgctgttgct cttttctgat agaacaaatt ctttcttcac aggaaatgga agagtttgtg 60 

cagagctctg gagaaaatgg tattgtggtg ttttctctgg ggtcgatgat cagtaacatg 120 

tcagaagaaa gtgccaacat gattgcatca gcccttgccc agatcccaca aaaggttaga 180 

taaagtgcct taactgtgga tggctactaa atgaatctgt taaactcttc aagagtccat 240 

tacagaaatg ttctgcctga aaatttaact gctatgatag ttctaattat ctcagacatc 300 

tgttcaaagc aaaaacatat atggaagatc ttaaaatcat aaagagagga gttttggttg 360 

ataataacgt tggcattaat attgtgatca gaaggaaata tatttaagag gtgctagtga 420 

agtttggtat tatcatggta tcgtagcatg tacatagaaa tcactaaatt ctgccctgtc 480 

<210> 117 
<211> 1602 
<212> DNA 
<213> H . sapiens 
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<220> 

<221> exon 

<222> (368) ... (455) 

<221> exon 

<222> (1295)... (1514) 
<400> 117 

tgagtcaagg gctgactttg aatagaatgg gaggtaggtt tgccctaagc agcttaactt 60 

ttccctttag catagagttt gggttgccaa gatttatttt cctttcacaa tctcatgtgt 120 

ctagctatta tgttagaaat gtcattattt ctttatatac aaaattgatt ataaaagtaa 180 

cgacattaaa cgtgggtatt caacttacct caaactttta gtagttctca ttacttgaca 240 

tcacttcttc ttatttcttc atcttttata tggattaact aactgattat taatctcttc 300 

agaattctaa catgctatgt ttttagagtt ctattcattg aacaagatat tttccttgcc 360 

ctaacaggtt ctatggagat ttgatggcaa gaagccaaat actttaggtt ccaatactcg 420 

actgtacaag tggttacccc agaatgacct tcttggtaag attctggaga acaaacagtg 480 

aatatattag taacagcaaa ttggagtgat aatagttcaa cataaaacaa acatatttag 540 

catttattat tggaaaacta aaaaacaaat caaatttaac tactttatat ttattttcca 600 

gtcttagtat aaaaagaatg cactatagta gttggcattt tattacatac agtcacattc 660 

tttatggtca gaataaaaat ctctttgttc aggtgtaatt tcctctcaca ggttttaaat 720 

aacatcctgg attttctgtc tgtctcctat ttatgcagct ttacctctgt tctttcccct 780 

actgcagggt tatttcaaca ggcactgaaa aatagcggac acttttctat taccagtgac 840 

tctacttttt atgggaataa ataaccaatc tttatcatga taaaatgata acacatttca 900 

tgatgatgca taaccggtcc ttcctcagcc ccacctccac cctactccct gctgcctttt 960 

aaaaaaaatt aaatatttta aatattttaa gtatttaaat attttttaaa tatgtaaatg 1020 

tgacctcatt atttataata cttaaaagac cacgttcttg tatacccaat cttattcttt 1080 

ttttttgcac attttaattt tttaattaag aatatgcttt ttcattttgt tcacctggca 1140 

attcttctga aatttgaaaa caatttcaat gcagttttgt gggtataatg ttacctaggg 1200 

aacagttttg ctttaagttc cttatattgt gcatttctta ttcaattctc ataccttgta 1260 

attaataatt ttgttaaaat gcatccactt ttaggtcatc ccaaaaccaa agcttttata 1320 

actcatggtg gaaccaatgg catctatgag gcgatctacc atgggatccc tatggtgggc 1380 

attcccttgt ttgcggatca acatgataac attgctcaca tgaaagccaa gggagcagcc 1440 

ctcagtgtgg acatcaggac catgtcaagt agagatttgc tcaatgcatt gaagtcagtc 1500 

attaatgacc ctgtgtgagt attacagttt tgtgaccagg tggtatttat aaattatttt 1560 

gtcaacagtg aatatgaatt ttaacccgtt tttaagagac ta 1602 

<210> 118 
<211> 978 
<212> DNA 
<213> H. sapiens 

<220> 

<221> exon 

<222> (326) . . . (978) 

<400> 118 

caaaaagatc attctcaaat tccatttcca ctatcttact tatagcactt agaatggctc 60 

ataatatttt ctgctccaga aaacattaac tttcccaccg aaaattccat ttttcatttt 120 

taaaggtatt tgtcagtgat aaaactccaa tttaaaaacc aaactttctg taatgacatg 180 

aattaaaaca ttgaaatttc atgccaattc agrtgacactt actttcaatc atttgtgtga 240 

cacttttcaa agaccatcca tagacttgat atgcttaagc aataaattta cttttaatgt 300 

tgatatcttt atatttatcc ttcagctata aagagaatgt catgaaatta tcaagaattc 360 

atcatgacca accaatgaag cccctggatc gagcagtctt ctggattgag tttgtcatgc 420 

gccacaaagg agccaagcac cttcgagtcg cagctcacaa cctcacctgg atccagtacc 480 

actctttgga tgtgatagca ttcctgctgg cctgcgtggc aactgtgata tttatcatca 540 

caaaattttg cctgttttgt ttccgaaagc ttgccaaaac aggaaagaag aagaaaagag 600 

attagttata tcaaaagcct gaagtggaat gactgaaaga tgggactcct cctttatttc 660 

agcatggagg gttttaaatg gaggatttcc tttttcctgt gacaaaacat cttttcacta 720 

cttaccttgt taagacaaaa tttattttcc agggatttaa tacgtacttt agttggaatt 780 

attctatgtc aatgattttt aagctatgaa aaatacaatg gggggaagga tagcatttgg 840 

agatatacct aatgttaaat gacgagttac tggatgcagc acgccaacat ggcacatgta 900 

tacatatgta gctaacctca cgttgtgcac atgtacccta aaacttaaag tataatttaa 960 
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aaaaagcaaa gggtaccg 

<210> 119 
<211> 20 
<212> DNA 
<213> H. sapiens 

<400> 119 
catgcaccta ttcagactgt 

<210> 120 
<211> 20 
<212> DNA 
<213> H. sapiens 

<400> 120 
tgggtgtcct gtagtagtga 

<210> 121 
<211> 25 
<212> DNA 
<213> H. sapiens 

<400> 121 
attgattttt cctcagatat aagta 

<210> 122 

<211> 22 

<212> DNA 

<213> H. sapiens 

<400> 122 
tcataatttc ccttaaaaac ac 

<210> 123 

<211> 22 

<212> DNA 

<213> H. sapiens 

<400> 123 
atatgtttgg gtatgttatt cc 

<210> 124 

<211> 18 

<212> DNA 

<213> H. sapiens 

<400> 124 
ccatattccc ctcactct 

<210> 125 

<211> 23 

<212> DNA 

<213> H. sapiens 

<400> 125 
atacctgcat attcaaataa caa 

<210> 126 

<211> 18 

<212> DNA 

<213> H. sapiens 
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<400> 126 
tatccagcca ttccttct 



18 



<210> 127 
<211> 22 
<212> DNA 
<213> H. sapiens 

<400> 127 

agttttgtgg gtataatgtt ac 22 

<210> 128 
<211> 19 
<212> DNA 
<213> H. sapiens 



<210> 129 
<211> 25 
<212> DNA 
<213> H. sapiens 

<400> 129 

tcataccttg taattaataa ttttg 25 

<210> 130 
<211> 20 
<212> DNA 
<213> H. sapiens 



<210> 131 
<211> 18 
<212> DNA 
<213> H. sapiens 

<400> 131 

tcatgccaat tcagtgac 18 

<210> 132 
<211> 17 
<212> DNA 
<213> H. sapiens 

<400> 132 

accctccatg ctgaaat 17 

<210> 133 
<211> 21 
<212> DNA 
<213> H. sapiens 

<400> 133 

tcaaagacca tccatagact t 21 



<400> 128 
aaacgggtta aaattcata 



19 



<40O> 130 
cgggttaaaa ttcatattca 



20 



<210> 134 
<211> 19 



-34- 



WO 00/06776 

<212> DNA 

<213> H. sapiens 

<400> 134 
ggagtcccat ctttcagtc 

<210> 135 
<211> 25 
<212> DNA 
<213> H. sapiens 

<4O0> 135 
attgattttt cctcagatat aagta 

<210> 136 
<211> 19 
<212> DNA 
<213> H. sapiens 

<400> 136 
atttactggc attgacaag 

<210> 137 

<211> 25 

<212> DNA 

<213> H. sapiens 

<400> 137 
attgattttt cctcagatat aagta 

<210> 138 
<211> 22 
<212> DNA 
<213> H. sapiens 

<400> 138 
tgtacagaaa gggtatgtta aa 

<210> 139 
<211> 20 
<212> DNA 
<213> H. sapiens 

<400> 139 
aaaaatkatt tggaagattc 

<210> 140 
<211> 22 
<212> DNA 
<213> H. sapiens 

<400> 140 
tcataatttc ccttaaaaac ac 

<210> 141 
<211> 23 
<212> DNA 
<213> H. sapiens 

<400> 141 
atacctgcat attcaaataa caa 
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<210> 142 
<211> 18 
<212> DNA 
<213> H. sapiens 

<400> 142 
tatccagcca ttccttct 

<210> 143 
<211> 25 
<212> DNA 
<213> H. sapiens 

<400> 143 
tcataccttg taattaataa ttttg 

<210> 144 
<211> 20 
<212> DNA 
<213> H. sapiens 

<400> 144 
cgggttaaaa ttcatattca 

<210> 145 
<211> 21 
<212> DNA 
<213> H. sapiens 

<400> 145 
tcaaagacca tccatagact t 

<210> 146 
<211> 19 
<212> DNA 
<213> H. sapiens 

<400> 146 
ggagtcccat ctttcagtc 

<210> 147 
<211> 20 
<212> DNA 
<213> H. sapiens 

<400> 147 
tgatacagct cagttgttac 

<210> 148 
<211> 20 
<212> DNA 
<213> H. sapiens 

<400> 14B 
tgatacagct cggttgttac 

<210> 149 
<211> 20 
<212> DNA 
<213> H. sapiens 

<400> 149 
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tgttgacatc ttcggcttct 

<210> 150 
<211> 20 
<212> DNA 
<213> H. sapiens 

<400> 150 
tgttgacatc gtcggcttct 

<210> 151 
<211> 23 
<212> DNA 
<213> H. sapiens 

<400> 151 
ctttaactaa aaatgatttg gaa 

<210> 152 

<211> 23 

<212> DNA 

<213> H. sapiens 

<400> 152 
ctttaactaa aaattatttg gaa 

<210> 153 
<211> 22 
<212> DNA 
<213> H. sapiens 

<400> 153 
tttaacatac cctttctgta ca 

<210> 154 
<211> 22 
<212> DNA 
<213> H. sapiens 

<400> 154 
tttaacatac cctttccgta ca 

<210> 155 
<211> 22 
<212> DNA 
<213> H. sapiens 

<400> 155 
ttggaggact tcactgtaaa cc 

<210> 156 
<211> 22 
<212> DNA 
<213> H. sapiens 

<400> 156 
ttggaggact tcagtgtaaa cc 

<210> 157 
<211> 23 
<212> DNA 
<213> H. sapiens 
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<400> 157 
tatgaggcga tctaccatgg gat 

<210> 158 
<211> 23 
<212> DNA 
<213> H. sapiens 

<400> 158 
tatgaggcaa tctaccatgg gat 

<210> 159 
<211> 24 
<212> DNA 
<213> H. sapiens 

<400> 159 
cccttgtttg cggatcaaca tgat 

<210> 160 
<211> 24 
<212> DNA 
<213> H. sapiens 

<400> 160 
cccttgtttg tggatcaaca tgat 

<210> 161 

<211> 22 

<212> DNA 

<213> H. sapiens 

<400> 161 
aaagagaatg tcatgaaatt at 

<210> 162 

<211> 22 

<212> DNA 

<213> H. sapiens 

<400> 162 
aaagagaata tcatgaaatt at 

<210> 163 
<211> 21 
<212> DNA 
<213> H. sapiens 

<400> 163 
gcttgccaaa acaggaaaga a 

<210> 164 
<211> 21 
<212> DNA 
<213> H. sapiens 

<400> 164 
gcttgccaaa aaaggaaaga a 
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